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PREFACE 
TO THE 2009 
REPRINTING 


Population genetics has undergone enormous changes since this 
book was written. It has become broader, deeper, and more rigorous. 
Computers have brought major advances. Some parts of the book are in 
error or at least not as well done as they could have been; many more 
have been superseded. A proper revision would be an enormous task; so 
it is better, I think, to leave the book as it was originally than to 
undertake what, at best, would be a makeshift job. I have, however, 
made a few minor corrections. 

This book has been out of print for some years and there has been a 
strong demand for copies. There are now several good books on 
population genetics, so is there stil! a place for this one? I believe there 
is, for some items are unique. It also, I believe, has a place as a historica! 
document. So I am grateful that The Blackburn Press has taken on the 
task of reprinting it. 

Motoo Kimura died on November 13, 1994, his seventieth birthday 
anniversary. We had several times discussed producing a revised edition 
of this book, but circumstances always intervened. For Motoo, his 
neutral theory became almost an obsession and occupied a major part of 
his time during his last 25 years. At the same time I was busy with a 
number of teaching, research, and administrative commitments. And 
there was the isolating influence of the Pacific Ocean. As a result, we 
never got around to doing a revision. 

This reprinting provides me with an opportunity to honor Kimura — 
my student and collaborator, outstanding scientist, and close friend. 


JFC 
November 2008 
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undergraduates in genetics and population biology. We hope that it 

willbe of value and interest to others also. [t is an attempt to present 
the field of population genetics, starting with elementary concepts and 
leading the reader well into the field. At first we intended to include 
experimental work; but this has been largely omitted, partly in the interest 
of coherence and partly because the book is already long. 

The first two-thirds of the book do not require advanced mathematical 
background. An ordinary knowledge of the calculus will suffice. For thereader 
who is not familiar with the mathematical and statistical procedures employed, 
we have added an appendix. The latter parts of the book, which deal with 
populations stochastically, use more advanced methods. We have made no 
attempt to explain all of these, either in the text or in the appendix. The reader 
with only elementary knowledge will have to accept some of the conclusions 
on faith. We have tried, however, to present the model and the conclusion, 


T: book is intended primarily for graduate students and advanced 
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leaving it to the reader to follow as much of the intermediate mathematical 
manipulation as he wishes. We have also added a number of tables and graphs 
in these chapters so that the major conclusions are available in this form. 

There are problems at the end of the first seven chapters. These are more 
numerous in the early chapters where the reader is more likely to wish for a 
means to test his knowledge. 

The bibliography is longer than is customary. There are many more 
articles listed than are referred to. We hope in this manner to provide a list 
which is a useful guide to the literature and which shows the richness and 
diversity of research in this field. 


Without any implication that they share any responsibility for the choice 
of content or for errors (although they are specifically responsible for the 
removal of a number of errors), we should like to thank the many people 
who have helped in various ways. We have benefited greatly from critical 
comments by Joseph Felsenstein. Others who have helped by suggesting 
alternative ways of presentation and by pointing out errors are Takeo 
Maruyama, Laurence Resseguie, Daniel Hartl, Carter Denniston, Thomas 
Wolfe, Etan Markowitz, and Tomoko Ohta. We should also like to thank 
the many students who have used in class the notes that were the forerunner 
of this book; they have been especially helpful in pointing out ambiguities. 

We should like to thank both our institutions, the University of Wisconsin 
and the National Institute of Genetics in Japan, for leaves that permitted us to 
work together on several occasions either in the United States or in Japan. 
TheRockefeller Foundation generously provided the initial support to get us 
started. 

Finally, our indebtedness to Professor Sewall Wright will be apparent in 
every chapter. In many, many instances it is his pioneering work that gave us 
something to write about. A number of topics that we have treated only 
lightly, or not at all, are included in his four-volume treatise, '* Evolution and 
the Genetics of Populations. ”’ 


J.F.C. 
M.K. 
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INTRODUCTION 


e are concerned in this book mainly with population genetics in a 
strict sense. We deal primarily with natural populations and less 
fully with the rather similar problems that arise in breeding live- 
stock and cultivated plants. The latter subject, sometimes called quantitative or 
biometrical genetics, emphasizes economically important measurements where 
the breeding system is under human control. Although this is not neglected, 
we emphasize more the behavior of genes and population attributes under 
natural selection where the most important measure is Darwinian fitness. 
As do most sciences, population genetics includes both observations and 
theory. The observations come both from studies on natural populations and 
from laboratory experiments. Sometimes the observations are verifications of 
existing theory, sometimes they are tests to distinguish among alternative 
theories, or they may lead to totally new ideas. Our emphasis is on the theory, 
but we shall occasionally make use of experimental or observational data, 
usually for illustration. 


2 INTRODUCTION 


The theory of population genetics is largely mathematical. By a biologist's 
standards it is highly developed, although a theoretical physicist might well 
regard it as rather primitive. A mathematical theory that could take into ac- 
count all the relevant phenomena of even the simplest population would be 
impossibly complex. Therefore it is absolutely necessary to make simplifying 
assumptions. To a large extent the success or failure of a theory is determined 
by the choice of assumptions— by the extent to which the model accounts for 
important facts, ignores trivia, and suggests new basic concepts. The general 
pattern of this book will be to start with the simplest models and then to 
extend these to more complicated, but more realistic, formulations. 

We have tried to steer a middle course between completely verbal 
biological arguments and the rigor of the mathematician. We have not hesi- 
tated to appeal to the reader's biological intuition and we often have used 
derivations and proofs that fail to take into account all the mathematical 
possibilities when these can fairly clearly be ruled out on biological grounds. 
Furthermore, we have sometimes used models which are somewhat vague, 
but which seem to us to have considerable biological interest and generality. 
We frequently use approximations rather than exact expressions, since we 
are more interested in finding an approximate solution to a model that seems 
tobe biologically interesting than an exact solution to one that is less interest- 
ing or realistic. When a choice is necessary we prefer generality and realism 
to precision and rigor. 

We continue the tradition of Sewall Wright and R. A. Fisher in using 
heuristic arguments that are not rigorously proven, especially in the use of 
continuous approximations and diffusion models. This involves a risk of 
later being shown to be wrong, but the history of the physical sciences is on 
the side of such a strategy. The view is well expressed by Richard Feynman 
in his Nobel Prize lecture (Science 153: 699, 1966): 


In the face of the lack of direct mathematical demonstration one must be careful 
and thorough to make surc of the point, and one should make a perpetual 
attempt to demonstrate as much of the formula as possible. Nevertheless, a 
very great deal more truth can become known than can be proven. 


mns: 
POPULATION 
GROWTH 


heoretical population genetics is concerned with model building. 

Any model of nature is an oversimplification, as is any verbal descrip- 

tion of a natural process. The model is an attempt to abstract from 
nature some significant aspect of the true situation. 

The models employed in population genetics are mathematical. The 
model is always unsatisfactory in some respects. Inevitably, it is unable to 
reflect all the complexities of the true situation. On the other hand, it is 
usually true that the more closely the model is made to conform to nature 
the more unmanageable it becomes from the mathematical standpoint. If it 
is as complex as the true situation, it is not a model. We have to choose some 
sort of compromise between a model that is so crude as to be unrealistic or 
misleading and one that is incomprehensible or too complex to handle. Those 
men who have laid the mathematical foundation for the theory of population 
genetics —J. B. S. Haldane, R. A. Fisher, and Sewall Wright—have had the 
capacity of inventing mathematical models that extracted the essence of the 
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situation in a formulation that could be handled mathematically. With in- 
creased mathematical sophistication, a more comprehensive and rigorous 
theory can be developed and much of the current research in theoretical 
population genetics is concerned with such developments. 

Just as the economist considers the broad consequences of individual 
transactions in a more or less free market, the population geneticist is inter- 
ested in the overall consequences of a large number of events— births, deaths, 
choice of mates, and the host of individual circumstances, habits, decisions, 
and accidents that determine these. As the physicist or chemist works with 
the statistical averages of molecular behavior and does not try to describe 
the behavior of each individual molecule, the population geneticist tries to 
describe the overall effect of a large number of individual events. 

It is convenient to divide mathematical models of population structure 
into two kinds, deterministic and stochastic. With a deterministic model, 
the population is assumed to be large enough and the factors determining 
individual birth rates and death rates constant enough that the consequences 
of random fluctuations can be ignored. This would be true only for an infinite 
population under highly idealized conditions; but actually many populations 
are large enough that the ** noise" introduced into the system by random 
processes is small enough in relation to systematic factors that, for the degree 
of approximation needed, it may be ignored. Deterministic models are much 
easier to handle mathematically, as wil! be abundantly clear throughout 
the book. 

Stochastic models take account of the effects of the finiteness of the popu- 
lation and other random elements. Some populations are small enough or 
the conditions are variable enough that random fluctuations are appreciable. 
The difficulty is with the mathematical complexity. Only the simplest situa- 
tions have exact solutions. However, it was the genius of R. A. Fisher (1930, 
1958) and Sewall Wright (1931, 1945, 1960) to devise procedures that provided 
very accurate approximations which have led to deep biological insights. 
Recently, more sophisticated mathematical techniques and electronic com- 
puters have been used to give more exact and extensive results. 

In this chapter we describe briefly some of the deterministic models 
that have been used in population genetics. Genetics is ordinarily concerned 
more with the relative frequencies of different genes and genotypes in a 
population than with the size of the total population. Nevertheless, in this 
chapter we shall consider the population as a whole. The purpose is to intro- 
duce the models rather than make use of them in genetic analysis; they will 
be used later. 

We introduce four deterministic models: 


]. Discrete, nonoverlapping generations, 
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2. Continuous random births and deaths, 
3. Overlapping generations, discrete time intervals, 
4. Overlapping generations, continuous change. 


In this book we shall make use of only the first two for genetic problems. 
Thus far the more realistic models 3 and 4 have been used mainly by ecologists 
and demographers. But the increasing mutuality of interest of population 
geneticists, demographers, and ecologists forecasts a greater emphasis on 
these models in population genetics. 


1.1 Model 1: Discrete, Nonoverlapping Generations 


This is in many ways the simplest description of population growth. We 
assume that the parent generation reproduces and that, before the offspring 
reach reproductive age, the parents have all died (or at least are no longer 
counted). Time is measured most conveniently in units of generations. 

This model is a realistic description of some populations, such as annual 
plants. For others it may still be a useful first approximation. It is widely 
used because of its mathematical simplicity. 

Let N, be the number of individuals in the population at time t, measured 
in generations. Jf the average number of progeny per individual is w, the 
population number in generation t can be expressed in terms of the number in 
the previous generation, t — 1, by 


N, = wN,.,. 1.1.1 


We regard w as a measure of both survival and reproduction; individuals 
who do not survive to the reproductive age are counted as leaving 0 progeny. 
We call w the Darwinian fitness, or simply the fitness. Each generation must 
be counted at the same age. It is often convenient to count the population 
as zygotes, so that the survival and reproduction of an individual occur 
within the same generation. 

The relation between N,. , and N,., is the same as that between N, 
and N,. ,. Therefore, if w remains constant 


N, = w(wN,-2) = w?N,-2 . 
Continuing this process, N, = w?N,. ,, and finally 
N, = Now, 1.1.2 


where Nọ is the number in the population in generation 0. 
The change in population size is analogous to money invested at com- 
pound interest. lf w = 1 + s, then sis equivalent to the interest rate. Jf w > ], 
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or s >Q, the population is increasing; alternatively, it is decreasing if w < 1 
ors <0. 
Equation |.].1 may be written 


AN, =SN,, 1.1.3 


where AN, is N,,, — N,. This shows that the proportion by which the popula- 
tion changes in one generation, AN,/N,, is given by s. 

If the population is composed of several kinds of individuals (perhaps 
several different genotypes) with different fitnesses, then the whole population 
is increasing or decreasing at the same rate as if all the individuals were of 
average fitness, the appropriate average being the weighted arithmetic mean. 
This is easily demonstrated as follows: 

Let ny, 72, ..., n, be the numbers of each of k types in the population 
in generation r — l, and let the fitnesses of these types be w,, w.,..., Wp- 
Assume that n +, -::: c n, =N and let, 

—  RW,cnwiccs +n W, nw; 

TU UD ea ee No 
Next generation there will be win, of type 1, wan, of type 2, and so on. 
Thus, letting w = Zn wj/ZXn;, 


N, = Wyn, wing coc + wn, = Ewn; 


= WN, -1- 


1.1.4 


If the fitnesses of the different types are inherited, those with greater 
fitness will be represented in greater proportion in the next generation. 
Hence w may be expected to change from generation to generation. In Chapter 
5 the amount of this change will be discussed. 

A related question is this: What happens when W is not constant from 
generation to generation? In this case, let wi, w,,..., Ww, be the value of w 
in the different generations. Then 


= wW — uw iw i — Wu uw o MU 
N, = Ww, AN, = We-i 1-2 Ni-3 = wW-1W,-2 Wo Ng 


t-i 


No [[ w; 11.5 
0 


= No Gr, 


where G,, is the geometric mean of the w’s (see A.1.2). 

To summarize: With a discrete generation model and variable fitnesses, 
the weighted arithmetic mean is appropriate for contemporary differences, 
while the geometric mean is appropriate for averaging over different 
generations. 
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1.2 Model 2: Continuous Random Births and Deaths 


In this model we regard the population number N, at timet as being very large. 
In an infinitesimal time interval At a fraction bAt of the population produce 
an offspring and a fraction dAt die. Thus the change in population number 
during this interval is 


AN, — (b — d)N, At 





1.2.1 
= mN, At, 
where m = b — d. 
As At 0, this becomes 
dN, N 
= mN,. 2. 
7 i 1.2.2 
In integrated form, mt = log, N, — log, No, or 
N,= Noe™. 1.2.3 


Time can be measured in any convenient units—hours, days, weeks, or 
years; in any case m is measured in the reciprocals of the same units (per hour, 
per day, etc.). 

In its strictest form this model applies only to such situations as bacterial 
growth in an unrestricted environment. Each individual is regarded as 
equally likely to die or reproduce in any instant. On the other hand, most 
populations are neither decreasing nor increasing very rapidly. The age 
distribution is often very near to equilibrium so that average birth and death 
rates are nearly constant. Under these circumstances the model may bea 
very good approximation. This is especially true if we are interested in changes 
in proportions of different types and such changes are slow compared with the 
average life span of an individual. 

Comparison of 1.1.2 with 1.2.3 shows the correspondence between w 
and m. If w = e", or m = log, w, the formulae are equivalent. Therefore if 
the time unit is chosen so that m — log, w, the continuous population changes 
such that, if counted at time intervals correspondingto these units, the numbers 
correspond to the discrete population. Thus measuring time in such units 
makes t in this sense equivalent to the number of generations in a discrete 
population. 

If w is nearly 1, and therefore s is very small, then log, (1 + s) is very 
nearly s. Thus for very slow change s and m are very similar. 

The comparison of the two systems may be made clearer, perhaps, with 
a concrete example. Consider first a discrete population that reproduces 
once each year, each parent giving rise to two progeny and dying immediately 
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after (or at least not being counted in the ensuing generation). Thus w — 2. 
The population doubles each year; N, = 2N,. , and N, = No2', 

A second population is growing continuously at such a rate as to double 
in a year's time. N, = N42'. From 1.2.3, e" 22, or m = log, 2 = .693. Thus, 
N, = No 69s 

A third population is also growing continuously, but has a growth rate 
equivalent to 100% interest compounded continuously. That is to say, 
m=1 and dN,/dt = N,. Integrating, N, = Noe‘; each year the population 
increases by a factor e = 2.71. 

More generally, the relations of the w’s, s’s, and m’s of the three popula- 
tions are: 


Population |: w=] +s, 
Population 2: m = log, w = log.(l + 5), 
Population 3: m - s. 


Figure 1.2.1 shows these relationships graphically. 


8 Mo 1 





6 No 
N 
4 No 
2 Mo 
No 
[] 01 a — — T — 
(0) 1 2 3 4 


Figure 1.2.1. Growth of three populations. In 
population 1 there are discrete generations with 
growth rate w = 1 + s, such that N, = Now’. In 
population 2 growth is continuous such that 

N, = Noe™ and m = log. w. In population 3 growth 
is continuous such that N, = Noe™ and m = s. The 
graphs are drawn for s — I. 
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We shall call w the fitness, or the Wrightian fitness, after Sewall Wright, 
R. A. Fisher (1930, 1958) designated m as the Malthusian parameter. 
Although used here as measures of the absolute rate of increase, w and m 
are also used for the relative rates of different types (see Chapter 5). 

Notice that if there are no births we can write 1.2.1 as 


dN, = aN, dt, 
which, on integration, is 

N, = Noe. 
(The death rate is written with a caret over the d to distinguish it from the 
derivative sign.) Hence the life expectancy curve is exponential with each 
survivor having a constant probability of surviving for another time unit. 


If there are different fitnesses of the different types in the population, 
1.2.2 becomes 


aN 
ae. myn, + M,N, + T + mm, ny 
1.2.4 
= mN, 
where 
_ inm, nm; 
n= — = 





Yn, N 


Hence, as with w in the discrete model, it is appropriate to replace m with 
the weighted average of the m’s in the population. 

When m varies in time we can think of time as broken into k intervals 
of length Af during each of which ñ is different. Then 1.2.3 becomes 


N, = No(e™"4")(e724"2) 5. ots (e^!) 





= No eM Ary + m2Atz +++ + mEATK 1.2.5 
=No e^", 
where 
m- XAnm, Atm; 


EAt, t 


Thus, in contrast to w in the discrete model, with a continuous model 
the appropriate average is the arithmetic mean of the m’s. This is reasonable, 
of course, since if m = log w, the arithmetic mean of m is the log of the geo- 
metric mean of w. 

As mentioned earlier, if there are different types in the population with 
different m’s and if the fitnesses are heritable, then the average value of m 
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will change from generation to generation as the fitter types make a dis- 
proportionate contribution to future generations. This is another way of 
saying that natural selection will generally lead to an increase in the average 
fitness. 

Our intuition tells us that the amount by which the average fitness changes 
per unit time will be related to the variability in fitness among the different 
types in the population. We shall now show that the rate of increase is given 
by the mean squared deviation or the variance. 

Consider, as before, that there are k types with frequencies n,, 75, ..., n, 
and fitnesses mı, m;,..., m, measured in Malthusian parameters. Assume 
that the fitness in each type is perfectly heritable and that the populations do 
not interbreed. This would be appropriate to separate species, or to asexually 
propagating clones. The formulae would also apply to a single locus in a 
haploid population, to a plasmon, or to competition between self-fertilized 
strains 

We wish to know how m changes with time. From the definition of m, 
given by 1.2.4, 


dm H d [x 
dt dt| N J 


The m,’s are constant, but the »,’s are variable; therefore the weighted 
average m and N = xn, are also variable. From the rules for differentiation, 





1.2.6 


T 

dm N£m, m — (£m; n;) zd 

dt N? 
But, from 1.2.4, 

dn; 

Pr m;n; 
and 

dN _ 

p = MN. 


Substituting these and recalling that 2;n; = NM, we obtain 


dm Xnjmi- Nim? 
URDU TTA TTD Pe TEE TU ee V, , 
dt N 


where V,,, is the variance of the m;'s (see A.2.4). 
We have just demonstrated a rather special example of R. A. Fisher's 
“ Fundamental Theorem of Natural Selection," which he presented in 1930. 


It says that the rate of change in fitness at any instant, measured in Malthusian 


1.2.7 
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parameters, is equal to the variance in fitness at that time. The variance in 
this case is the variance in fitness among the types. In a Mendelian population 
with different genotypes there will be crossing among the types. As expected, 
the rate of change in fitness is then determined not by the total variance, but 
by the part of the variance that is associated with transmissible gene differ- 
ences. The way in which this variance may be computed will be shown in 
Chapter 5. 


1.3 Model 3: Overlapping Generations, 
Discrete Time Intervals 


We now consider a more complicated model. The time is divided into discrete 
units, which are usually short relative to the life span. The model is especially 
appropriate for species, such as many birds and mammals, which have a 
specific breeding season, but which may survive for several such seasons. 
It is also a good approximation for organisms, such as man, where the popu- 
lation, although changing continuously, is censused at discrete intervals. 
In man, time would ordinarily be measured in years or in 5-year intervals. 
We follow closely the procedures of Leslie (1945, 1948) as given by Moran 
(1962, p. 7). 

Let n,, be the number of individuals of age x at time t. More precisely, 
“age x" means “in the age interval x to x + 1" as in ordinary discourse. 
In bisexual forms it is convenient to adopt some convention in regard to the 
sexes; one such is to count only the females. Let p, be the probability of 
surviving from age x to age x + 1. Likewise, let b, be the average number of 
progeny produced by an individual of age x, counting only those progeny 
that survive long enough to be counted in the next time interval. To be 
concrete, we shall discuss the model as if time were measured in years. 

For convenience we shall assume that no individual lives more than 5 
years, although the procedure is obviously capable of being extended for any 
number of time intervals. Starting from t = 0, the number of individuals 
of age 0 at time 1 will be the number born to parents of age 0, ngo bo, plus 
those born to parents of age 1, n9 b,, and so on. In turn, the number of age 1 
will be the number who were age 0 at time 0 multiplied by the probability 
of surviving from age 0 to age 1, noo Po, and so on for the other ages. Writing 
these as recurrence relations, we obtain 


noi = "gobo + mobi + 12952 + ngo b3 + N40 ba, 
M1 = "oo Do; 
N21 — opi, 
N31 ="20P2, 


N30P3> 


Nay 
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and 
n, = O. 1.3.1 
From these equations the composition of the population a year later 
can be worked out. For example, 
no; = Mo; bo + MB, + nj b; + Naiba + naiba, 
"2=NoiPos 
N22="1P1, 
n3) = M1 P2, 
N42 — M31 P3, 
and so on for later years. 


Table 1.3.1 gives data on the composition of a population in successive 
years, starting with the highly artificial situation of 10,000 individuals of 


Table 1.3.1. The age composition of a population starting with 10,000 new born. 
The survival probabilities at age 0, 1, 2, 3, and 4 are .8, .9, .9, .7, and 0; the birth 
rates for the same ages are 0, .3, .5, .5, and .2. A stable age distribution and a con- 
stant rate of population growth are attained in about 20-25 units of time. 


PROPORTION OF AGE 





TIME POPULATION N, 
eo ai 142 2-3 344 qus. NUMBERS Nis 
0 1.000 0 0 0 0 10,000 
| .000 1.000 .000 .000 .000 8,000 .800 
2 .250 .000 .750 .000 .000 9,600 1.200 
3 .300 .160 .000 .540 .000 12,000 1.250 
4 .294 .222 .133 .000 .350 12,960 1.080 
5 .268 .310 .264 .158 .000 9,835 .759 
6 .265 .187 -244 .207 .097 11,267 1.146 
7 .288 .203 .161 .210 .139 11,785 1.046 
8 .280 .235 .187 .148 .151 11,532 .979 
9 .275 .230 .217 .172 .106 11,247 .975 
10 .277 .214 .201 .190 .17 11,553 1.027 
12 .278 .225 .198 .172 .126 11,627 .995 
14 .278 .220 .198 .180 .124 11,763 1.0092 
16 .278 .222 .199 177 .124 11,847 1.0023 
18 .278 .221 .198 .178 .124 11,967 1.0054 
20 .278 222 .199 .178 .124 12,067 1.0041 
25 .2782 .2222 .1985 .1779 .1239 12,343 1.0045 


Limit .2782 | .2222 .1985 .1778 .1239 oo 1.0045 
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age 0. The survival probabilities are taken to be pọ —.8, p, = .9, p; = 9, 
Pa =.7, P4 = 0. The birth rates are bj = 0, b, = .3, b, —.5,b4 = .5, and b, = 
.2. The information is shown graphically in Figure 1.3.1. Note that the popul- 
ation age distribution fluctuates for several generations, and then reaches a 
constant proportion for each age. At this time the population is increasing 
at a constant rate of 0.45% per year. 


12000 = 
Eo 
D 4 
ba 
- DN i IAMA B HM M mM M 
in: 7REBRBRBERBRBRBBRBRRBE ES: 
KS R K e eio p k R eee ee OR OR 
BAe eee Oe Ae ee Aa 
IAB RUM ABB AU aU eee AA 
Anuapnaadaanaadandadudaadun 
"^ VA JA A 4 Z A 
2/mnnuunocadudnudaddadndd 
6000 A B A At Au 
À 
A 
J | 
4000 A E 
A 
KA 
2000 ll 0 
0 
0 2 3 4 5 6 7 8 9 0 n 1% 13 14 15 16 Y 18 





Figure 1.3.1. The total number and age distribution of a population starting 
with 10,000 individuals of age 0. The survival probabilities at the 5 ages are 
Po = .8, Pi = 9, p; = -9, Ps = -7, Pa =O. The birth rates at the corresponding 
ages are bo = 0, b, = .3, b; =.5, b = .S and b, =.2. After wild fluctuations 

in the first few generations the population settles to a constant age distribution 
and a rate of increase of 0.45 7; per unit time. The total population scale is on 
the left ordinate, the age at the right. The number of time units is given along 
the bottom. 


The example illustrates the general principle that any population with 
a fixed schedule of age-specific birth and death rates eventually reaches a 
characteristic age distribution which remains stable. When this stage is 
reached the growth rate is constant and equations 1.1.1 and 1.1.2 are appli- 
cable. 

We can carry out a more complete analysis, including a demonstration 
that stability of age distribution is eventually attained, by using matrix 
notation. The necessary techniques are given in Sections 7 and 8 of the Appen- 
dix. 


14 AN INTRODUCTION TO POPULATION GENETICS THEORY 


In matrix form, the relations 1.3.1 become 


"o, bo b, bi by b4YV[noo 
ny Po 0 0 0 Oj|[nmo 
i = 0 Pi 0 0 0 N20 1.3.2 
n3, 0 0 p 0 Oj[7o 
Nay 0 0 O p, Of ing 
or, in abbreviated notation, 
Ni, = MN, , 
from which 
N, — M'N,. 1.3.3 


Thus the composition of any future year can be obtained by successive 
multiplication of matrices. 
The characteristic equation (see A.8) is 


IM — AI| = 0, 1.3.4 
which on expansion gives 


25 -— bo A* — Po b,2? — popibi a - PoPiPib3À — Po PiP2P3b4 = 9. 
1.3.5 


We now let 2, stand for the positive real root of this equation. From 
the fact that all coefficients after the first are negative we know that there is 
one and only one positive real root. lt is also the largest in absolute value 
(for a demonstration of this see Moran, 1962, p. 8). It can be found from 1.3.5 
by trial and error. 

Consider now the equation 


MI = Al, 1.3.6 


where M is the original matrix and / is a column vector (corresponding to 
/, m, n in equation A.8.3). The solutions for the ls can be gotten, after 7, 
is determined, from 1.3.6. We assume that the roots are unequal. 


When t is large 74, is much larger than 25, 74,..., so that we can write 
(see A.8.9) 

no, = Ch AT 

n, = Ch | approximately, 1.3.7 

na, = Cla 


where C is a constant determined by the initial conditions. 


MODELS OF POPULATION GROWTH 15 


The population thus has a stationary age distribution with individuals 
of age 0, 1, 2, 3, and 4 in the ratio /, : /, :/; : /4 : l4. Notice that these ratios 
do not depend on the initial age distribution. 

To solve for the /'s we write 1.3.6 in the form 


lg Ay = lo bo + lb, + Lb, + h b, + l, b4, 

hà, = lopo, 

hà, = lPi, 

là, = lı pi, 

l, h = hPa. 

From the second equation, /, = /gpo/A,. From the third, /, = /4,p,/4, = 
lo Pop;/å}. Continuing this pattern, we obtain the solutions 

lo = lg , 

I, = lo PoAr', 

Ly = lo po pii ^, 1.3.8 

l5 = lo Po Pı P24» 

l, = lo poPi P2P341* 

The /'s give the ratios of the numbers in the different age groups: 
be converted into proportions by dividing by the total number. 

Notice that when A, = I, the stable age distribution is determined entirely 
by the survival rates. For example, the number of age 4 is proportional to 
PoP: P2P3, Which is the probability of survival to this age. 

We now inquire into the biological meaning of /,, the largest root. 
When the age distribution is stabilized, then all age groups change at the 
same rate. By summing 1.3.7 we obtain 

En, = C(Xlj)A,. 

But, when t=0, Zn;; = No, the initial population number. Therefore 
CXl; = No, and we can write 
N, = No À, 1.3.9 


for a population with a stable age distribution. 

Comparison with equation 1.1.2 reveals the meaning of 4,. This corre- 
sponds to w, the population fitness. Thus, when a stable age distribution is 
attained, the whole population (and each age group within it) changes by a 
factor A, in each time unit. 

We have considered only five age classes. The generalization to any 
number of classes is obvious. A more general and rigorous discussion of this 
model is given by Moran (1962, p. 7ff). 
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This section brings out the circumstances under which the simple 
equations of Section l are applicable. They clearly apply when the generations 
are discrete and nonoverlapping. But we see also that if the population has 
had a given schedule of birth and death rates for a time long enough to reach 
age-distribution stability, then the equations of Section 1 are appropriate. 
Many populations in nature change in total size at a rate that is slow relative 
to the life duration; under these circumstances there would be an approximate 
age-distribution stability. 

In summary, a population with a fixed schedule of age-specific birth 
and death rates eventually attains a state when the numbers of individuals in 
the various age groups are in a fixed ratio. The ratio of numbers in the differ- 
ent age groups at this stage is given by the /'s of equation 1.3.8. At this stage 
the population is changing in total size at a rate determined by 4,, the largest 
eigenvalue. When 4, is greater than l the population is increasing; when it is 
less than 1, the population is decreasing; when A, = 1, the population is 
stable in size. 

The hypothetical population in Table 1.3.1 and Figure 1.3.1 has a value 
of 4, = 1.00449, which corresponds 
0.45 % per year. 

Table 1.3.2 gives the transition matrix for the female population of the 
United States in 1964, from Keyfitz and Murphy (1967). The survival and 


Table 1.3.2. The transition matrix for the female population in the United States 
for birth and death rates of 1964 (Keyfitz and Murphy, 1967). The values are for 
5-year periods. The principal diagonal gives the probability of surviving from one 
period to the next and the first row gives the number of female births per woman 
during this age period. 


AGE 
0-4 5-9 10-14 15-19 20-24 25-29 30-34 35-39 40-44 45-49 


.0000 .0010 .0878 .3487 .4761 .3377 .1833 .0761 .0174 .0010 


9966 0 0 0 0 0 0 0 0 0 
0 .9983 0 0 0 0 0 0 0 0 
0 0  .9979 0 0 0 0 0 0 0 
0 0 O0  .9968 0 0 0 0 0 0 
0 0 0 0 9961 0 0 0 0 0 
0 0 0 0 0 9947 0 0 0 0 
0 0 0 0 0 0 9923 0 0 0 
0 0 0 0 0 0 0 .9887 0 0 
0 0 0 0 0 0 0 0 .9831 0 
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fertility values are for 5-year periods. The dominant root, 4,, is 1.082. This 
tells us that when a population with this structure of age-specific birth and 
death rates reaches a stable age distribution it will be increasing at a rate of 
8.2%. This is the rate per 5-year time unit, so the annual rate of increase 
would be about one-fifth of this, or 1.6%. 

An ordinary mortality cr “life” table gives the proportion remaining 
alive at various ages with a given schedule of births and deaths. From 
equations 1.3.8 we know that the table gives the relative numbers in each 
age group in an age-stable population that is neither increasing nor decreasing 
(A, = 1.000). A growing population will have more in the younger age groups; 
a decreasing population will have more in the older age groups. 


1.4 Model 4: Overlapping Generations, 
Continuous Change 


The methods of the previous section are specifically adapted to an organism, 
such as many annually reproducing plants, where reproduction is seasonal 
but where the typical individual may live through several such seasons. In 
many other organisms, ranging from short-lived insects to man, births as 
well as deaths occur at all times. If the population is large the discreteness 
introduced by individual births and deaths is lost in the large total and the 
process of population change can be regarded as essentially continuous. 
Mathematically we think of the process as the limit of the previous method as 
the age intervals get smaller and smaller. 

The equations for dealing with such a population were developed by 
Lotka (1925, 1956) and Fisher (1930, 1958). Our procedure follows Fisher 
(1958, pp. 25-27). To be concrete, we shall think of the human population 
where time is measured in years. 

We let /(x) be the probability of survival from birth to age x. (We do not 
intend to imply that x can take only integral values; at some instant a person 
may be 27.4658 years old.) Let the probability of reproducing during the 
infinitesimal age interval from x to x + dx be b(x) dx. Then the probability 
of living to age x and reproducing during the next time interval dx is K(x)5(x) dx. 
The expected number of offspring per individual for his whole lifetime is 
this quantity summed over all ages, or 


MELE dx 1.4.1 
0 


where A is the highest age at which reproduction is possible. For convenience 
mathematically we replace A with œ, which doesn’t change anything since 
b(x) is O for all ages beyond A. If the integral is greater than | the population 
will eventually increase, although there may be temporary decreases because 
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of changes in the age distribution; if the quantity is less than | the population 
will eventually decrease. 

In the previous section we saw that a population with a constant set of 
age-specific birth and death rates attains a stable age distribution. In this state 
the population will increase or decrease at a constant rate. To correspond to 
A or w in the discrete case we use e" in the continuous case, where m is the 
Malthusian parameter (see Section 1.2). We now wish to see how m can be 
determined from the birth and death rates. 

Of the population now alive (at time z, say), those of age x were born x 
years ago at time ¢ — x. Let the number of births occurring in the interval 
dt now be B(t)dt. B(t) is the instantaneous birth rate at time t; if this rate 
continued for a year, there would be B(r) births. The birth rate x years ago 
was B(t — x). Of those born at that time a fraction /(x) will still be alive and 
of these a fraction b(x)dx will give birth during the interval dx. Thus the 
current birth rate of persons of age x will be B(t — x)/(x)b(x), and this, summed 
over all ages, is the total birth rate at time z. This gives us 


B(t) f Bu — x)I(x)b(x) dx. 1.4.2 


If the population has achieved age-distribution stability, its size (and 
therefore its birth rate) is increasing at a rate 77. |n x years the birth rate will 
have increased by a factor e"*. Therefore, the rate x years ago was fraction 
e "* of the current rate. That is to say, 


B(t — x)= B(t)e ™. 


Substituting this into 1.4.2 gives, after cancelling B(ft) on both sides of the 
equation, 


re | e^"*l(x)b(x) dx. 14.3 
0 


This provides a means for calculating m if I(x) and b(x) are known for all 
ages. 

This equation is analogous to 1.3.5. Perhaps the analogy is more appar- 
ent if we divide by 2? and rewrite 1.3.5 as 


l = A Ibo + 27? Pobi + À pop b; dee 1.4.4 


Just as 1.4.4 has only one real root, so does 1.4.3. In these equations A^ * 
corresponds toe "*, (x) to po pipa: p, 1, and b(x) to b,. In actual practice, 
m is usually computed from discrete data, since birth and death rates are 
usually given at discrete (e.g., yearly) intervals. 
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When a population has reached age-distribution stability the formulae 
of Section 1.2 are applicable, in particular 1.2.2 and 1.2.3, so 


— — mN and N, = Noe™. 1.4.5 

dt 

The Malthusian parameters corresponding to the birth and death rates 
in the United States in recent years are given in Table 1.4.1. As mentioned 


Table 1.4.1. The Malthusian parameters of the United States population corre- 
sponding to birth and death rates in particular years. This is the rate at which the 
population would increase if the population came to age-distribution equilibrium 
with these rates of births and deaths at each age. Source is U.S. Census data. 


DATE MALTHUSIAN PARAMETER 
1940 0.0010 
1945 0.0045 
1950 0.0137 
1955 0.0198 
1960 0.0208 
1961 0.0205 
1962 0.0188 
1963 0.0171 
1964 0.0156 
1965 0.0121 
1966 0.0097 
1967 0.0074 


earlier, these give the rate at which the population would increase if it were 
in equilibrium for the age distribution. The value 0.016 obtained earlier for 
1964 data is seen to be in close agreement despite being based on analysis 
of 5-year units. The changes in the Malthusian parameter as a result of chang- 
ing death and, more importantly, birth patterns in recent years is striking in 
the data. At the 1940 rates the population would have been nearly stable in 
size. Then there were substantial increases in fertility rates, followed by a 
decline in recent years. 

The way in which mortality acts at various ages in an insect species is 
illustrated by the data in Table 1.4.2. These come from an exceptionally 
thorough study by Varley (1947), also discussed by Haldane (1949). The total 
mortality in these gall flies is about 99 77, compensated of course by a corre- 
spondingly high fertility. The table shows the comparison between the death 
rate and the Malthusian parameter m, given by the negative of the log-survival. 
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Table 1.4.2. An illustration of the death rates at different ages in the life cycle of a 
gall insect, Urophora jaceana. The data are given as the death rate and as the 
Malthusian parameter, m = log. (survival rate). Data from Varley (1947). 


MONTH DENSITY/METER? CAUSE OF DEATH DEATH RATE m 
July 203.0 
Infertile eggs .090 — .094 
184.7 
Failure to form gall .201 — .224 
147.6 
Died in gall .020 —.021 
144.6 
Eurytoma curta 455 —.607 
August 78.8 
Parasites .365 —.455 
50.0 
Probably mice .616 —.957 
Winter 19.2 
Mice .635 — 1.009 
7.0 
Unknown .257 —.297 
5.2 
Birds and parasites .308 —.368 
May 3.6 
Floods .436 —.573 
July 2.03 
Total .990 — 4.605 


Note that the Malthusian parameters are additive while the survivals are 
multiplicative. The total Malthusian parameter, — 4.605, is log, 0.01 as it 
should be. 


1.5 Fisher’s Measure of Reproductive Value 


Fisher (1958, p. 27) also asks a genetically relevant question: To what extent 
does an individual of age x contribute to the ancestry of future generations? 
In order to answer this, he defines the quantity v(x), the reproductive value 
at age x. (We have also used the letter V to stand for the variance but we shall 
make clear from the context which meaning is intended). 
Obviously the reproductive value is 0 for an individual who is past the 
reproductive age. It clearly is lower at birth than a few years later, since a 
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person of age 10, say, has a better chance of surviving to reproduce than has 
a child at birth. Furthermore, the reproduction would begin sooner and, in a 
growing population, this would increase the total contribution to future years 
by an earlier start. The value might be expected to be maximum somewhere 
near the beginning of the reproductive period. 

To define the reproductive value we note first that the number of births 
from parents of age x is proportional to e^ "*(x)b(x) dx, as shown in the pre- 
vious section. If we now think of this cohort of persons followed through 
the rest of their lifetimes, their total contribution is proportional to 


oo 

| e ""Ky)b(y) dy. 
x 

The exponential term serves to diminish the value of children born a long 
time in the future. This is analogous to the situation where the present value 
of a loan or investment is greater if it is to be paid soon rather than later. 
This is reversed, of course, if m is negative (as would be the present value of a 
loan or investment if interest rates were negative or the investment were 
decreasing in value). The reproductive value is proportional to the total 
contribution per individual of this age, so we divide the contribution by the 
number of persons of that age. This leads to the definition of reproductive 
value at age x, 


oo 
f e "10)60) dy 
I : 1.5.1 

° e ""l(x) 

If x = 0, the denominator is equal to 1. Likewise, if x = 0, the numerator 
is equal to | from equation 1.4.3. Therefore, the reproductive value at birth 
is 1 , and x(x) is a measure of the reproductive value of an individual of age x 
relative to that of a newborn child. 

In developing 1.5.1 we discussed the situation as if the population were in 
age-distribution equilibrium. On the other hand, we can accept the definition 
as given and apply it to populations in general. From this a remarkable 
property emerges: Irrespective of the age distribution, the total reproductive 
value of a population increases at a rate given by m. 

This can be shown as follows. First we rewrite 1.5.1 as 


e7"*lo)(9) = | e""()bQ) d». 


Differentiating both sides with respect to x leads to 


p Ee ae) + I(x) 2o — sooo = —e ™](x)b(x). 
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Cancelling e^ "* and dividing both sides by v(x)/(x) gives 
1 dl(x) | d(x) |. b(x) 


— —— 4 — m= — . 

I(x) dx v(x) dx v(x) 

The leftmost term, with sign changed, is simply the death rate d(x), for it is 
the rate of decrease in the number of age x expressed as a fraction of the 
proportion alive at that age. Making this substitution and rearranging, we 
obtain 


dv(x) 
dx 


The first term is the rateof change in the reproductive value of anindivid- 
ual as his age increases. The second is the rate of decrease in reproductive 
value per individual caused by deaths of individuals of age x. The third is the 
rate of increase in reproductive value from new births; this is simply the 
instantaneous birth rate, since the value of each newborn child, v(0), is equal 
to 1. The left side of the equation, then, is the net change in reproductive 
value of the population contributed by an individual of age x, either by 
growing older, by dying, or by giving birth. The n(x) individuals of age x then 
contribute mn(x)v(x) to the increase in reproductive value of the population. 

Thus the rate of change in reproductive value for each age group is given 
by m. Adding up all ages, we have 








— v(x) d(x) + b(x) = m(x). 1.5.2 


dv 
dt = Mb 
and 
v, = Vo e™, 1.5.3 


where v is the total reproductive value of the population. 

This demonstrates Fisher’s principle: The rate of increase in total 
reproductive value is equal to the Malthusian parameter times the total 
reproductive value, regardless of the age distribution. This means that the 
equations of Section 2 become applicable for populations not at age equilib- 
rium if each individual in the population is weighted by the reproductive 
value appropriate to his age. 


1.6 Regulation of Population Number 


We have said nothing so far about population regulation. It is obvious that 
a population cannot grow exponentially forever. It must eventually reach a 
state where m becomes 0 or negative, or where in a discrete model w becomes 
l or less. The growth rate is eventually limited by all the factors that collec- 
tively make up the carrying capacity of the environment. 
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In population genetics we are mainly concerned with the changes in 
proportions of different types of individuals, rather than total numbers. 
We shall consider some examples of this under various types of population 
regulation. However, we shall ignore until later in the book the complications 
introduced by Mendelian inheritance. 

We shall be dealing in this section with continuous models of the type 
introduced in Section 1.2. We are assuming the kind of model described in 
that section, or if the population has a more complicated structure we assume 
that it has reached age-distribution stability. Alternatively, in principle we 
could deal with reproductive values rather than actual numbers by weighting 
each individual by the reproductive value appropriate to its age. 

Equation 1.2.2 or 1.4.5 can be modified to take regulation into account 
by writing 


dN 
ET rN[I — f(N)]. 1.6.1 


Thequantity r is the intrinsic rate of increase—the rate at which the population 
would grow if it had unlimited food supply and room for expansion. The 
function f/(N) implies some change in the rate of increase with the size of the 
population. The regulation may be, for example, by limitation of food 
supply, by the space available, by the accumulation of toxic products, or by 
territorial behavior patterns. 

A particularly simple model is provided by letting f/(N) bea linear function 
of N,say N/K, where K is a constant sometimes called the carrying capacity 
of the environment. Such a population will grow approximately exponentially 
as long as N is much smaller than K, but as N approaches K the rate will 
decrease until size stability is reached at N — K. 

Substituting N/K for f(N) in 1.6.1 and rearranging, we have 


aN  rN(K — N) 


— 1.6.2 
dt K 
The equation may be rewritten as 
dN dN 
— + ————- rr dt, 
N K-N 
which is readily integrated to give 
t= l In NR =No) No) 1.6.3 
r (K = N, )No 


Here and throughout the book In means log, . 
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For example, if the intrinsic rate of increase of a population is 1% per 
year (r = .01) and the carrying capacity, K, is 5000, the time, f, required to 
change the number from No = 1000 to N, = 2000 is 

1 2000 x 4000 


t = 91 ^ 3000 x 1000 


If there were no regulation, the time required (see 1.2.3) would be 


— 98 years. 


T In — = 69 years. 
r No 

Notice that, whether there is regulation or not, the time required for a 
certain change is proportional to [/r. 

We can also write 1.6.3 in the inverse form, giving the number at time / 
as a function of ¢ and the initial number No. This is 


K 
N,————, 6.4 
© 14 0$,€7" : 
where 
-N 
Co = K 9 E 
No 


This function is shown graphically in Figure 1.6.1. 
The curve is often called the “logistic”? curve of population increase 
and has been widely used in ecology (Pearl, 1940; Nair, 1954; Slobodkin, 


K 


No 


rt 


Figure 1.6.1. The logistic curve of population increase. The 
ordinate is the population number; the abscissa is rf, the product 
of the intrinsic rate of increase and the time. No, the initial 
population, is taken as 1/20 the value of K, the final number. 
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1962). Of course it is merely the simplest of a number of equations that 
could be derived and many populations, natural and experimental, depart 
widely from the model. 

As stated earlier, we are mainly concerned in population genetics with 
the proportion of different genes and genotypes rather than the total number. 
We shall see that many of the equations for the proportions of different types 
are the same, despite quite different mechanisms for regulation of the popula- 
tion number. 

The intrinsic rate of increase, r, is closely related to the Malthusian 
parameter m. We shall use the latter for the actual rate of change in numbers 
of the population, or of a part of the population, and r for the value this 
would take in a situation where the growth rate is not regulated. 

In Chapter 5 we shall consider the effects of natural and artificial selection 
on the composition of a population. Now we are considering a simplified 
system in which the complications of Mendelian genetics are omitted. The 
essential points are brought out more clearly this way. Furthermore, the 
general principles that we are trying to show can be illustrated with only two 
classes; the extension to an arbitrary number presents no difficulties. So we 
are treating two competing species or asexual clones. Actually the same 
equations can be adapted to a single locus in a haploid population, to a 
cytoplasmic particle, to a gene on the Y chromosome or to competition 
between two self-fertilized strains. 

The methods in the following sections come largely from Egbert Leigh. 


1. Unregulated Growth Consider two strains, ] and 2, with numbers n, 
and n, and intrinsic growth rates r, and r;. These are thesame as the Malthu- 
sian parameters when there is no restriction on continuous exponential 
growth. Let N be the total population number; i.e., N = n + m. We shall 
designate by p, = n,/N and p; = l — p, = n/N the proportions of the two 
strains. If there is no regulation of the growth of either strain the rates of 
increase are 


dn, 

uo cun 
and 

dn; 

usnm 1.6.5 
The rate of increase of the total population is 

dN E 

ui — run +ran =FN, 1.6.6 


where r is the mean of the r's, weighted by the numbers in each population. 
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To obtain the rate of change in the proportions of thetwo types, we write 
d In(p,/P2) = d In(ni/n;) 
dt a dt 


_dinn, dinn, 


dt dt 1.6.7 
_ dn, dn, 
— n, dt ~ n, dt 














Notice also that 


d\n(p,;/p2) dlinp, din(l— py) 


dt dt dt 
OPI a Sd Pis... 
pidt | (1— py) dt 
THREE NE 
p(l — pi) dt 
Putting these two equations together gives 
dp 
ea UM - r2)p,\(1 — pi). 1.6.8 


Notice that if we let p, = N/K andr, — r; = r we obtain equation 1.6.2. 
So, 1.6.8. is the equation of a logistic curve. Despite the fact that both strains 
are growing exponentially, the proportion of one type (the faster growing 
one) is increasing according to the logistic equation. 

Equation 1.6.8 can be written in another form by noting that r — 
Pir; pora. Substituting for r, in 1.6.8 gives 

zh = p,(r, — r). 1.6.8a 
This form of the equation suggests the extension to more than two strains. 
When three or more strains are present the same equation is correct for the 
rate of change of a particular strain and r is the weighted average of the rates 
of increase of all the strains. 

In this model the total population is increasing exponentially at any 
instant, although the rate of increase, 7, is changing continuously as the faster 
growing strain replaces the slower. This model is obviously unrealistic for 
any long period of time. We discuss it here to illustrate the point that equa- 
tions of the type 1.6.8. can accurately describe the rate of change of the 
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proportion of one type in a mixed population even when the numbers are 
changing according to quite a different rule. This is also true with various 
forms of regulation as the following examples show. 


2. Logistic Regulation of Total Number A simple model for this 
situation is given by the equations, 


dn - 
qr Qc NIK) 
and 
dn - 
a n(r2 T rN|K). 1.6.9 


The total population increases logistically until it reaches an equilibrium 
at N = K, as can be seen by writing 


dN d(n, + 
p E dt mo) — run, +rana — (nj t njrN[K. 


But, since NV — n, +n, and 
Nr =r,n, TrQn, 


dN 
— - rN(I — N/K). 1.6.10 
dt 
This is the equation for logistic growth (see 1.6.2). 
At the same time that N is changing according to this rule, we can see 
what is happening to the proportions of the two types by writing 


dln(p,[p)) dn, dn, 


dt Sandt ndt 





ri— r5, 


or following the pattern of 1.6.7 and 1.6.8, 


d 
Ti = p,(r,-—r). 1.6.11 


Again, the proportion of strain | is changing logistically. 
Equation 1.6.8 may be written in integrated form as 


In[p/(1 — p)] = C + rt, 1.6.12 


where r — r, — r, and C is In[pj/(1 — po)], a constant determined by the 
initial composition. This suggests a convenient way of plotting data from 
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selection experiments; by plotting In[p/(1 — p)] against time one can easily 
see whether the trend is linear and thus see if the logistic equation is appro- 
priate. 

Alternatively, if we wish to know how much time is needed to change 
the proportion from po to p,, we can write 1.6.12 as 


l p Pill = Po) 
_ n —À 
r  (1—p)po 
which gives the time as a function of the frequency of the type of interest. Note 


that the time required to accomplish a certain change in proportion is 
strictly proportional to the reciprocal of r. 


t= 1.6.12a 


3. Weaker Population Control The previous example assumed that the 
total population has an absolute upper limit, K. We now considera population 
that is limited, but the limit is proportional to r so that as one type replaces 
the other the total population increases. A simple model is 


dn 
P = ny(r1 — cN) 
and 
dn 
p =n,(r, — cN). 1.6.13 


The total number changes according to 


dN 

a N(r — cN). 1.6.14 
The population reaches a limit when N = r/c; for this value dN/dt = 0. 

The same procedure as before leads to the equation for change in pro- 
portion of type 1, 


dp, = pr, — F). 1.6.15 

dt 

This situation is probably quite unusual in nature. The size of the popu- 
lation is usually determined mainly by factors other than the r's. A replace- 
ment of the original strain by one with a higher r will cause only a slight 
increase in the final population number, if indeed there is any change at all. 

The point of these three examples is to show that, despite great differences 
in the way in which the total population changes, the changes in proportion 
follow the same general rule given by 1.6.15. 
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4. Regulation by Efficiency of Space or Food Utilization If popu- 
lations are regulated by the available space, food, or some other limiting 
factor, the type that wins in the competition may not be the one with the 
higher intrinsic rate of increase, but rather the one that can maintain the 
largest numbers in this environment. A simple model illustrating this possibil- 
ity is given by 


dn 
n =r n,(K,—N)/K, 
and 
d 
an n,(K,—N)/K3. 1.6.16 


One interpretation of K, is that this is the maximum population size that 
strain | can maintain when it is the only species; K, has the same meaning 
for strain 2. Suppose, to bring out the point of interest, that r, = r}, but 
K, * K;; the two strains differ, not in their intrinsic growth rates, but in the 
maximum number that this environment can support. 

The change in total number is given by 


dN n, n; 

anid by eee 6. 

F | K, xl 1.6.17 
where r = r; = r;. The change in the proportion of type 1 is 

dp 

T = Rp,p2 = Rpy(d — py); ka 
where 

K,— 1 
R = Nr| ——— |. 
| K,K; 


Equation 1.6.18 is in the general form of the logistic equation (see 1.6.8), 
but it is not the same since R is not a constant. However, in many cases R 
is changing slowly and an equation of the form of 1.6.8 describes the rate of 
change at any particular time. 

In an uncrowded environment the success of a population is determined 
mainly by its intrinsic rate of increase, r. In a crowded environment the carry- 
ing capacity, K, for the species may be more important. MacArthur and 
Wilson (1967) refer to “r selection" and “ K selection." In an uncrowded 
environment (r selection) types which harvest the most food, even if they are 
wasteful, have the largest rate of increase. On the other hand, in a crowded 
environment (K selection) there is a great value on efficiency of utilization 
rather than simple productivity. 
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Ecologists have considered in more detail equations of these types. 
For many purposes it is more meaningful to measure biomass rather than 
simply count numbers, so that oneisn't in the position of equating one mouse 
with one elephant. But we shall do no more with the subject here and get 
on to problems that are more strictly genetic. 

In Chapter 5 we shall be dealing with the changes in gene and genotype 
frequencies under selection. The equations will be basically similar to 1.6.15, 
complicated by the Mendelian mechanism. The purpose of this section, as 
stated before, is to show that equations in thesame form are widely applicable, 
even though the populations may differ greatly in their states. The total 
population may be growing, or static, or decreasing and the different geno- 
types may differ in intrinsic birth rates, death rates, or their response to the 
environment; yet the equations for changes in proportions may be basically 
similar. For these reasons, population genetics has usually ignored the total 
numbers and concentrated on the proportions. 


1.7 Problems 


1. In a population with discrete generations and with fitness w, how many 
generations are required to double the population number? 

2. How long is required for the population to double with model 2? 

3. A population under model 3 has reached age stability. How long, in 
units of A, will be required for the population to double? What is the 
effective generation length, defined as the unit that will give the same 
answer as problem 1 ? 

4. Suppose you know the age-specific death rates (the probability that an 
individual of age x will die during the next time unit). What is the life 
expectancy, that is, the mean length of life? What is the median length 
of life? 

5. Show that equation 1.6.8a is correct for any number of strains. 

6. What are the median and mean length of life under model 2, expressed 
in terms of the death rate, d? 

7. Show that the time required to change the number from N, to N, in a 
logistically growing population exceeds that in an unregulated population 
with the same intrinsic rate of increase by In[(K — N9)/(K — N))]/r. 

8. Again considering a logistic population with carrying capacity K, what is 
the time required to go from a fraction x to a fraction y of this capacity? 

9. One bacterium which reproduces by fission and follows a logistic growth 
pattern is introduced into each of several ponds. Show that the time 
required to fill a pond to half its capacity is proportional to the log of the 
carrying capacity. 
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e are now ready to consider populations with Mendelian inheritance 
and begin by inquiring into the frequencies of the different geno- 
types that comprise the population. Inthe last chapter we were inter- 
ested in the total number of individuals in the population and in different 
subpopulations. In genetic studies there is usually greater interest in the relative 
numbers of different genotypes, so it is convenient to express the numbers of 
different types as proportions of the total. As long as we use deterministic 
models the total number in the population is not important and we can deal 
as well with proportions. With stochastic models the number in the population 
becomes important in determining the extent of random fluctuations and 
therefore must be taken into consideration. 
Only deterministic models will be considered in this chapter, and for 
the most part we shall be concerned with a model of discrete generations. 
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2.1 Gene Frequency and Genotype Frequency 


The number of possible genotypes in a population greatly exceeds the number 
of genes and soon becomes enormous. A diploid population with only two 
alleles at each of 100 loci would have 3!°° possible genotypes, a number far 
larger than the number of individuals in any population. Therefore we effect 
a great simplification by writing formulae in terms of gene frequencies 
rather than genotype frequencies. This usually entails some loss of information, 
for knowledge of the gene frequencies is not sufficient to specify the genotype 
frequencies; but it is usually possible to do this to a satisfactory approxima- 
tion by introducing other information, such as the mating system and linkage 
relations. 

Furthermore, in a sexually reproducing population the genes are re- 
assorted by the Mendelian shuffle that takes place every generation. The 
effects of such reassortment are largely transitory, being undone as fast 
as they are done. For long-range trends we look to the changes in the fre- 
quencies of the genes themselves. As R. A. Fisher (1953) said: 


The frequencies with which the different genotypes occur define the gene 
ratios characteristic of the population, so that it is often convenient to con- 
sider a natural population not so much as an aggregate of living individuals 
as an aggregate of gene ratios. Such a change of viewpoint is similar to that 
familiar in the theory of gases, where the specification of the population of 
velocities is often more useful than that of a population of particles. 


At the outset we need formulae to specify the relation between gene 
frequencies and genotype frequencies. This will also serve to introduce the 
kind of notation that will be used throughout the book. We start with a single 
locus with only two alleles. 

Consider a population of N diploid individuals, of which 


N,, are of genotype A,A,, 
2N, 2 are of genotype A, A, , and 
N,2 are of genotype A, A;, 


where N,, + 2N;; + N,, = N. It is sometimes convenient to distinguish, 
among 4, A, heterozygotes, those that received A, from the mother and those 
that received it from the father. We can do this by designating the two 
numbers as N,; and N;,. However, in most populations N,, = N;, and it 
is not necessary to make any distinction between them. 

We designate the frequency or proportion of the three genotypes by 
Pii, 2P,, , and P2, as follows: 


AA: Py, =N,,/N, 
A44, : 2P,, = 2N,2/N, 
A,A,: P22 =N2,/N. 
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From these genotype frequencies, we can write the frequencies of alleles 
A, and A; , which are designated p, and p; , as follows: 


N,, +N 
pe Pat P 
2.1.1 
Na, 4N 
R= RP. 


The blood groups provide convenient examples, and Table 2.1.1 shows 
some data on the frequency of MN types in Britain. 


Table 2.1.1. Numbers of persons of blood types M, MN, and N in a sample from 
a British population. Data from Race and Sanger (1962). 


PHENOTYPE M MN N 
GENOTYPE MM MN NN TOTAL 


NUMBER 363 634 282 1,279 
FREQUENCY 0.284 0.496 0.220 1.000 


Pı OF Pu = .284 + 4(.496) = .532 
Di OT Pu =.220 + }(.496) = .468 


Extension of this principle to multiple alleles causes no difficulty, though 
the symbolism becomes a little more abstract. Equations of the type of 2.1.1 
suggest the nature of the extension. Consider a locus with n alleles. As before, 


we designate allele frequencies with small letters and genotype frequencies 
with capital letters. 


Alleles: A, A, A; A; A, 
Frequencies: py P2 Pi Pj Pn 


The subscripts i and j are used to designate any two different alleles. 

With these symbols, and letting P;; stand for the frequency of genotype 
A; A; and 2P; stand for the heterozygous genotype 4; Aj, we obtain 

Pe= Py t Prater t Py tc bP tot Pin 


n 2.1.2 
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This procedure is applicable in any situation where each genotype is 
identifiable. The effects of other loci may be ignored if they do not obscure 
the distinction between the genotypes of the locus under consideration. 

Because of dominance, it is often not possible to distinguish all geno- 
types. For example, in most blood group studies no distinction is made 
between AA and AO persons, both being classified simply as belonging to 
blood group A. Under such circumstances the allele frequency can be measured 
only if there is some knowledge about the way in which the genes are combined 
into genotypes in the population. The simplest assumption, and fortunately 
one that is often very closely approximated in many actual populations, is 
random mating—the subject of the next section. 


2.2 The Hardy-Weinberg Principle 


With random mating, the relation between gene frequency and genotype 
frequency is greatly simplified. By random mating we mean that the matings 
occur without regard to the genotypes in question. In other words, the prob- 
ability of choosing a particular genotype for a mate is equal to the relative 
frequency of that genotype in the population. 

Notice that it is possible for a population to be at the same time in 
random mating proportions for some genes and not for others. For example, 
it is quite reasonable for mating to be random with respect to a blood group 
or serum protein factor but be nonrandom with respect to genes for skin 
color or intelligence. 

In many respects, random mating among the different genotypes in the 
population is equivalent to random combination of the gametes produced 


Male Gametes 





Female Gametes 


Figure 2.2.1. A demonstration that random 
combination of gametes leads to the Hardy- 
Weinberg frequencies of zygotes. The gametes 
are along the margins, with the zygotes being in 
the body of the table. 
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by these individuals. We consider a single locus with two alleles, A, and 43, 
with frequencies p, and p; (p, + p; = |). The frequencies are taken to be the 
same in both sexes. The results of random combination of gametes are shown 
in Figure 2.2.1, where the gamete types are on the margins and the resulting 
zygotes are in the body of the table. 

Hence, the first generation after random mating begins, the proportions 
of the three genotypes 4,4, , 4,42, and 4; 4, will be p?, 2p,p, , and p2. 
Later we shall show that these proportions, gotten by combining gametes at 
random, are in fact equivalent to what is obtained by random mating. 

This principle was first reported for the special case p, =p, =.5 by 
Yule (1902) and Pearson (1904), and for other values by Castle (1903). Later 
it was shown for any value of p, and p, by Hardy (1908) and Weinberg (1908), 
and has since come to be known as the Hardy-Weinberg principle. 

The Hardy- Weinberg principle is thought of by population geneticists 
in two somewhat different ways. It is sometimes spoken of as the Hardy- 
Weinberg equilibrium, with the implication that in the absence of factors 
that change the gene frequencies (mutation, selection, migration, random drift) 
and with random mating the population will immediately arrive at and remain 
in the same proportions. Note also that the population variability remains the 
same from generation to generation. Less inclusively, the principle says simply 
that, given the gene frequencies and assuming random mating, the zygotic 
frequencies can be predicted. In this book we shall usually regard the principle 
in the second way; we shall consider the effects of various influences on the 
gene frequencies and then use the Hardy-Weinberg principle to relate gene 
frequencies and genotype frequencies. 

It is easy to verify that random mating of genotypes leads to the same 
expected zygote frequencies as random combination of gametes. Using the 
same symbols as before, we can enumerate the various mating combinations 
and their relative frequencies as shown in Table 2.2.1. 

From the bottom row in the table, noting from 2.1.1 that p, = P,, + P; 
and p; = P,; + P22, we see that in one generation the three genotypes 
attain the same proportions as given by random combination of gametes. 

The fact that the equilibrium is attained immediately after random 
mating begins, rather than gradually, is of great importance. This means that 
it is not necessary to inquire into the past history of a population; if mating 
was at random during the preceding generation, the principle holds. The only 
exception is the rather unusual case where the gene frequencies are different 
in the two sexes, a topic that we shall return to later. 

Notice that p? --2p,p;- p2 =(p, + p2). Therefore the random- 
mating principle can be stated this way: The array of genotype frequencies 
in the zygotes is given by the square of the array of gene frequencies in the 
gametes. 
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Table 2.2.1. Derivation of the Hardy-Weinberg principle by combining parental 
genotypes in random proportions. 


PROGENY 
FREQUENCY OF 
MATINO THIS MATING 
A141 A14; A242 
AAi XA A; (P11)? Pih 
A, Ay x A142 2(P11)(2P 12) 2P, Pi 2P, Pi 
A, A, X 424; 2(P,1X P22) 2P,,Pi; 
A, Az X Ai Az (2P 12)? Pii 2P}, Pi 
A, A2 x A242 2(2P 2)(P22) 2P32P22 2P 12 P22 
A242 X Ar A2 (P22)? P2, 
Total (Pi, +2Pi2 + P22)? (Pay + P12) 2(P,, + Pr2Pi2 + P22) (Pii + P22)? 
=1 = pi = 2p P2 =p} 


As a numerical example, return to the MN blood group data discussed 
earlier (Table 2.1.1). From these data the frequency of the M allele, py, 
was computed to be .532 and the frequency of the N allele, py, was .468. 
Thus, by the Hardy-Weinberg principle the three genotypes MM, MN, and 
NN would be expected to be in the ratio (.532)?, 2(.532)(.468), and (.468)?. 
As shown in Table 2.2.2 these expectations are in remarkably good agreement 
with the observed numbers. 


Table 2.2.2. Comparison of the observed proportion of MN blood types with the 
proportions expected with random mating. The data are from Table 2.1.1; p, = 
.532 and py = .468. 


GENOTYPE MM MN NN TOTAL 
Expected proportion .283 .498 .219 1.000 
Expected number 362.0 636.9 280.1 1279.0 


Observed number 363 634 282 1279 
x? = 0.029, Prob = .87 


The x? method is described in the appendix. In this instance there is only one 
degree of freedom, rather than the two that might have been expected. This is 
because the gene frequency has been estimated from the data, thereby reducing 
the number of degrees of freedom by one. 
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A second example: The incidence of the recessive disease, phenylketo- 
nuria, which results in mental deficiency is approximately | in 10,000. What 
proportion of the population are heterozygous carriers of this condition? 

We let A stand for the normal allele and a for the recessive allele for 
phenylketonuria, and designate the gene frequencies by p, and p,. The Hardy- 
Weinberg principle shows that p? is .0001, so p, is the square root of this, 
or .O1. Since p, and p, must add up to 1, p, is .99. Therefore, the frequency 
of heterozygotes in the population (2p,p,) is 2(.99)(.01) = .0198. 

This example illustrates the important fact that, when the recessive gene 
is rare, the number of heterozygous carriers of the gene is enormously larger 
than the number of individuals who carry the gene in the homozygous state. 
In this example the carriers outnumber the affected in the ratio of 198 to 1. 

Still considering a recessive factor, we can inquire as to the parentage 
of individuals that are homozygous for a recessive factor. If all genotypes are 
equally viable and fertile, and mating is at random, the matings that could 
produce homozygous recessive progeny are as shown in Table 2.2.3. 


Table 2.2.3. Computation of the relative proportions of different genotypes among 
the parents of homozygous recessives, aa. 


s FREQUENCY OF RELATIVE 
MUS jos HOMOZYGOUS- FREQUENCY OF 
A RECESSIVE RECESSIVE 
MATING 

PROGENY PROGENY 

Aa x Aa Qp4pJ* papa pa 
Aa x aa 2(2 pa Pa)(p2) 2pa Ppa 2PaPa 

aa x aa (p2)? pi Di 
Total pa 1 


Thus, even if persons with phenylketonuria were normally viable and 
fertile and married at random with the rest of the population, the great 
majority of phenylketonuric children would still come from normal but 
heterozygous parents. The ratio of feeble-minded children with normal 
parents, those with one normal parent, and with neither parent normal 
would be 9801 : 198 : 1. 

One might question the validity of the Hardy-Weinberg assumption for a 
condition such as phenylketonuria, where affected persons are likely not to 
have children at all. Nevertheless, the error involved in making this assump- 
tion is very small, for the same reason that has already been emphasized. 
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Homozygous recessives make up such a small fraction of the population that 
no serious error is made by assuming that all classes marry at random, which 
is a reasonable assumption for the homozygous-normal and heterozygous 
classes. 

The Hardy-Weinberg principle is exactly true only in an infinitely large 
population in which mating is completely at random; but it is approximately 
correct for the great majority of genes in most cross-fertilizing species. 

The principal departures are of two types: (1) inbreeding and (2) 
assortative mating. Inbreeding occurs when mates are more closely related 
than if they were chosen at random. The most extreme examples are self- 
fertilizing species, but even in species with obligatory cross-fertilization there 
may be some inbreeding mainly because of the frequent geographical pro- 
pinquity of close relatives. Ways in which inbreeding can be measured and 
its effects taken into account are given in Chapter 3. 

Assortative mating occurs when mates are phenotypically more alike 
than with random mating. For example, tall men tend to marry tall women 
and two people are more likely to marry if they have similar intelligence. 
There are also numerous examples in experimental animals of mating pre- 
ferences that lead to assortative mating in heterogeneous populations. 
Assortative mating is discussed in Chapter 4. 

A population that is in Hardy-Weinberg frequencies when the zygotes 
are formed may show significant departures at adult stages due to differential 
mortality. Usua!ly the younger the stage counted, the closer is the realization 
to random-mating proportions. On the other hand, differential fertility does 
not affect the Hardy-Weinberg relation as long as there is no correlation 
between the fertility of an individual and its choice of mate. 

Despite these numerous discrepancies of detail, the random-mating 
assumption fits very well a great many real situations. The most important 
reason is that the genotype proportions are attained after only a single genera- 
tion. This means that a perturbation for any reason is rectified immediately 
rather than slowly over a number of generations. 

It might be of interest to examine some circumstances that are exceptions 
to the above statement in that the equilibrium frequencies are approached 
gradually rather than suddenly. One of these is with sex-linked genes where 
the gene frequencies are different in the two sexes; this will be discussed later 
in the chapter (Section 2.5). Another is a population in which each individual 
can reproduce both asexually and sexually, as can many plants. 

Let C be the proportion of progeny that are produced clonally or asexu- 
ally and S the proportion sexually by random mating. Let P, be the proportion 
of genotype AA at generation r and p be the frequency of gene A. We assume 
that C is independent of the genotype of the individual. Notice that the geno- 
type frequency P, changes from generation to generation whereas the gene 
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frequency p remains constant. In the following generation the 44 plants 
can be produced in two ways: (1) AA plants, with frequency P, , reproducing 
clonally with probability C and yielding a fraction CP, of the total progeny, 
and (2) plants that are homozygous or heterozygous for allele 4 mating at 
random with probability S and producing Sp^ A44 progeny. Thus, in the next 
generation the frequency of AA genotypes will be 


Pray = CP, + Sp’. 
Rearranging, and recalling that C+ S — 1, 
Pia, — p = CP, + SP — p? 
= CP, — p*(1— S) 2.21 
= C(P, — p°). 
There is the same relation between P, and P,_, as between P,, , and P,. Thus 
P, — p! = CP,- — p?) 
= C?(P,_, — p?) 2.2.2 
= C'(P, — p°) 


where P, is the initial frequency of the genotype 4A. Unless C = 1 (exclusively 
clonal reproduction), the limit of C' as t increases is 0, and P, becomes closer 
and closer to p?. The same procedure can be applied to any other genotype. 
Each genotype frequency approaches the Hardy-Weinberg value, but only 
gradually at a rate determined by C. The smaller the fraction of asexual 
progeny, the more rapidly the random-mating proportions are approached. 

Another situation that leads to a gradual rather than a sudden attainment 
of Hardy-Weinberg proportions is overlapping generations. We can consider 
random mating under population model 2 (continuous random deaths and 
births) by supposing that in any small time interval of length dt, a fraction of the 
population, chosen at random, dies and a new fraction is produced by random 
mating of the existing population. The details of this process are given by 
Moran (1962, pp. 23-25). It is clear that the Hardy-Weinberg proportions 
would be approached asymptotically as the proportion of the population 
derived from mating increases and the old survivors constitute a smaller and 
smaller fraction of the population. 

In the simple Moran model the life expectancy is exponential. Thus the 
final equilibrium would be reached strictly only after an infinite period of 
time. However, most organisms have a finite life expectancy, and random- 
mating proportions would be attained as soon as the last survivor from the 
parent generation had died. 
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2.3 Multiple Alleles 


Additional alleles do not introduce any serious difficulty. The principles of 
the preceding section are easily extended to include any number of alleles. 
For multiple alleles the Hardy-Weinberg principle can be stated this way: 

If p, is the frequency of allele A; and p; is the frequency of some other 
allele 4;, the expected frequencies of the genotypes with random mating are 


Ai Ai: Pu = pẹ, 2.3.1 

A; A; : 2P;; = 2p; pj, (Pi; = Pi). 2.3.2 

As an example consider the blood group alleles O, A, and B. The allele 
frequencies are pg , pa and pg and the blood groups are as shown below. 


BLOOD GROUP GENOTYPES FREQUENCY 
(0) oo pò 
AB AB 2PaPs 
A AA or OA Pat 2poPa 
B BBor OB Pè + 2pops 


The equations 2.3.1 and 2.3.2 enable one to determine the expected 
genotype frequencies when the gene frequencies are given. For the reverse 
problem, to determine the gene frequencies when the genotype frequencies 
are given, the relevant formula is 2.1.2. 

The problem is more difficult when not all genotypes are distinguishable. 
It is necessary to make some assumption about the mating system; usually 
one assumes random mating unless there is evidence to the contrary. Consider, 
for example, a 3-allele system in which A, is dominant to both A, and A, 
and A, is dominant to A,. Such a system is frequently found: for example, 
blood groups A,, A,, and O when B is disregarded; B?°, B?, and B blood 
groups in chickens; agouti, chinchilla and white in rabbits. 

Suppose P is the observed proportion of 4,—, Q the proportion of 
A,—, and R of A, A4. Equating the observed and the expected proportions 
gives 


P = pi t2ppi + 2pyps, 
Q =p + 2p2Ps, 
R=p3, 

which on solving gives 
P3 = JR, 
Pr =J/Q+R-J/R, 
p21- J/ Q* R. 
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When there are more phenotypes than genes there is no unique solution 
to the equations such as those just given. In this case the usual method of 
estimating the gene frequencies from the phenotype frequencies is the maxi- 
mum-likelihood method of Fisher (see A.9). As seen in the appendix, the 
solution in the previous paragraph is in fact the maximum-likelihood solution. 

The problem of classifying and enumerating the different possible 
genotypes with a large number of multiple alleles is a difficult and intriguing 
problem. For a discussion, see Cotterman (1953, 1954), Bennett (1956), Hartl 
and Maruyama (1968), and Cotterman (1969). 


2.4 X-linked Loci 


For alleles on the X chromosome the genotype frequencies in the male are 
the same as the gene frequencies, since the male is haploid for this chromo- 
some. The female frequencies, on the other hand, are the same as for diploid 
autosomal loci, since there are two X chromosomes. (We are discussing 
organisms in which the male is heterogametic; when the female is hetero- 
gametic the following statements should be reversed accordingly.) 

This means that there will usually be a large difference in the frequency 
of an X-linked character in the two sexes. For example, a trait caused by a 
recessive gene, a, with frequency p would have a frequency p in males, but 
p? in females. If the gene were rare, the frequency in males would be much 
greater than in females. A classical example of a human X-linked trait is 
color blindness. One of the earliest and largest studies was by Waaler (1927). 
He found that in a group of 18,121 school children in Oslo, about 8 % of the 
boys were color-blind, but only 0.4% of the girls. The data are in Table 2.4.1. 

If we take the proportion of color-blind boys as p, we can estimate the 
proportion of girls as p?. As can be seen from Table 2.4.1, the observed 


Table 2.4.1. Frequencies of color blindness among school children in Oslo. Data 
from Waaler (1927). 


MALES FEMALES 


OBSERVED 
EXPECTED ON BASIS 


NUMBER PROPORTION 
OF MALE FREQUENCY 


NUMBER PROPORTION 


Color-blind 725 .0801 = p 40 .0044 p? = .0064 
Normal 8324 .9199 =1 — p 9032 .9956 
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number does not agree very well with this expectation. We can use the x? 
test to see if the disagreement is too great to be reasonably attributed to chance. 
To obtain the expected numbers by the maximum-likelihood method, 
we let p be the relative frequency of the allele for color blindness and 4, 
B, C, and D be the observed numbers of color-blind males (725), normal 
males (8324), color-blind females (40) and normal females (9032), respectively. 
Then, if m is the proportion of males and N the total observed number (N = 
A-B-C-D-—-18,21), the probability of the observed results is 


Prob = K(mp)^[m( — PIPL — np? I — mX1 — pp. 
where K is a constant determined by A, B, C, and D. Letting L — log Prob, 
L — (A + B)log m + (C + D)log(1 — m) + (A + 2C)log p 
+ (B + D)log(I — p) + D log(l + p) 


plus a constant. Differentiating with respect to m and p, and equating to 
zero, we obtain: 


ôL A+B C+D_ 











ôm m  l-m 
oL A+2C B+D D ET 
Op p | l+p 
Solving these equations, 
A+B 9049 
m= = —, 
N 18121 


as expected, and 





» —B  J/ B? -- 4(A +20)[A + B+ 2(C  D)] -— 
2[A B 4 2(C + D)] 





The expected numbers in the four classes are Nmp, Nm(1 — p), N(1 — m)p? 
and N(1 — my(1— p°). 

The expected numbers are given along with the observed numbers in 
each of the four classes in Table 2.4.2. The number of degrees of freedom is 
3 — 2 = |, since 2 degrees of freedom were lost by estimating m and p from 
the data. The probability of obtaining a disagreement as great as this or greater 
by chance is about .03, so we look for an explanation. 

A plausible basis for the discrepancy is revealed by a finer analysis. 
There are four kinds of color blindness: (1) protanopia, or red blindness; 
(2) protanomaly, or partial red blindness; (3) deuteranopia, or green blindness; 
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Table 2.4.2. Statistical analysis by the x? method of the data in Table 2.4.1. The 
expected numbers are computed on the assumption of random mating by the method 


of maximum likelihood. 


OBSERVED EXPECTED DEVIATION? 
NUMBER NUMBER PESTO EXPECTED 
Color-blind males 725 698.6 26.4 1.00 
Normal males 8,324 8,350.4 — 26.4 0.08 
Color-blind females 40 54.1 —14.1 3.67 
Normal females 9,032 9,017.9 14.1 0.02 
Total 18,121 18,121.0 4.77 
x? =4.77 P=.03 


and (4) deuteranomaly, or partial green blindness. The numbers of the four 


types are given in Table 2.4.3. 


If we let p,, P2, P3, and p, stand for the frequencies of the four kinds of 
abnormal X chromosomes, these proportions can be estimated directly from 
the proportions in males. From these proportions we can estimate the ex- 
pected proportions in females. The expected proportions in females in Table 
2.4.3 are based on the assumption that the capacities for red and green vision 


Table 2.4.3. A finer resolution of the data on color blindness by taking into account 


different types of color blindness. 


MALES 





NUMBER PROPORTION NUMBER PROP. 


Protanopia 80 .0088 =p, 
Protanomaly 94 .0104=p,. 
Deuteranopia 93 .0103 = ps 
Deuteranomaly 458 .0506=p.4 
Normal 8324 .9199 = ps 
Total 9049 1.0000 





FEMALES 
EXPECTED PROPORTION 
BASED ON MALE 
FREQUENCIES 

0 .0000 pi = .0001 

3 .0003 pi + 2p: Pı = .0003 

1  .0001 pi = ,0001 

36  .0040 pi + 2p3 p. = .0036 
9032 .9956 
9072 1.0000 
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are complementary; that is, the heterozygote for a red deficiency and a green 
deficiency has normal vision. On the other hand, protanomaly and protanopia 
are regarded as noncomplementary, with the heterozygote being protanoma- 
lous. Likewise deuteranomaly acts as if dominant to deuteranopia. There is 
supporting evidence for this hypothesis from other sources; for example, 
there is an instance where a woman with normal vision produced two kinds 
of sons—red deficient and green deficient. 

Notice in Table 2.4.3 that the female frequencies fit the expectation very 
well indeed. Ideally we could test for agreement by a y? test, using maximum- 
likelihood estimates based on the frequencies in both sexes, but the procedure 
hardly seems necessary in this case. 

The close fit of the data to the expectation supports the hypothesis that 
the red and green components are complementary. Notice that this analysis 
does not distinguish between a single locus and two loci. If it is assumed that 
there are two separate loci, the quantities p,, p;, P3, Pa, and p; are inter- 
preted as chromosome frequencies rather than allele frequencies. Evidence 
since these data were collected has shown that there are two loci and recom- 
bination has been reported. 


2.5 Different Initial Gene Frequencies in the Two Sexes 


Usually the gene frequencies are the same in the two sexes, but under some 
unusual circumstances they may be different. One possibility is in animal 
or plant breeding where the original hybrids are made up of males of one strain 
and females of another. Another is in a natural population where most of 
the migrants are of one sex. For example, in human populations there are 
examples where most of the migrants to a newly colonized area were male, 
and many of the marriages were between local women and immigrant men. 


Male Gametes 


Female Gametes 





Figure 2.5.1. Random combination of gametes when 
the allele frequencies are different in the two sexes. 

The single and double asterisks refer to male and female 
frequencies, respectively. 
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For an autosomal locus, the Hardy-Weinberg frequencies are attained, 
but after a delay of one generation. Consider a population in which the fre- 
quencies of the alleles A, and A, are pf and p? in males and p1* and p3* in 
females. If the gametes are combined at random, we obtain the results in 
Figure 2.5.1. 

In the next generation the frequency of gene A, in both sexes is 


Pi = Pipi* + dpipl tdpi'pi 


= (pt + pi*), 


2.5.1 


and in all following generations the genotypes are in the proportions pj, 
2p, p; , and p3. As expected, the gene frequency is the unweighted average of 
what it was originally in the two sexes. 

The demonstration that random mating of the different genotypes gives 
the same results as random combination of the gametes follows the same 
general method as was used in Table 2.2.1. Likewise, extension to multiple 
alleles is straightforward and will not be discussed here. 

With an X-linked locus, starting out with different allele frequencies in 
the two sexes, the situation is quite different. The equilibrium, instead of 
being attained in two generations as for an autosomal locus, is reached only 
gradually. 

Consider a particular X-linked allele, 4, in a multiple-allelic series. Let 
the frequency of this allele in generation ¢ be p; in males and pž* in females. 

Since a male always gets his X chromosome from his mother, (1) the 
allele frequency in males will always be what it was in females a generation 
earlier. Likewise, (2) the frequency in females will be the average of the two 
sexes in the preceding generation, since each sex contributes one X chromo- 
some. Finally, (3) the mean frequency of the gene will be the weighted average 
of that in the two sexes, attaching twice as much weight to the female frequency 
as the male because the female has twice as many X chromosomes. This 
quantity must be a constant, since the mean frequency of the gene does not 
change. These statements may be stated mathematically as follows, where 
t — | designates the previous generation. 


(2) p'* 2 3p, + do 2.52 
(3) p. = Ap! + áp * = 4p7_, + 4p, = B1 P, 


where p is a constant throughout the process. 
From the third equation we have prz. , = 3p — 2p**,. Substituting this 
into the second equation, after some algebraic rearrangement we obtain a 
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relation between the gene frequency in the female sex from generation to 
generation. 


p —p--Ap-p. 2.5.3 


Since the same relationship holds for p**, and p?*,, 


pi* —p-(-3' (t^ — p 
= (—4)°(pt*, - p) 
and, finally, 
p'*—p-(-X'(pi* — P), 2.5.4 
where p%* is the allele frequency in females in the initial generation. Notice 


that these formulae do not depend on equal numbers of males and females 
in the population. 


2 
3 Males 


Females 


Gene Frequency 


wl 





0 1 2 3 4 5 6 
Generation 


Figure 2.5.2. The frequency of an X-chromosomal gene in 
the two sexes. In generation 0 the gene has a frequency of 

1 in males and 0 in females. The weighted average frequency 
in the two sexes remains 4 throughout. 
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Thus the approach to equilibrium is gradual. Since (— 1/2)' approaches 0 
as t becomes large, the gene frequency must approach p as a limit. In each 
successive generation the gene frequency in females is only half as far from 
the final value, p, as it was in the previous generation, but in the opposite 
direction. The population moves toward equilibrium in a zig-zag manner, 
like a series of damped vibrations. 

From equation 2.5.2, it can be seen that the gene frequency in males 
follows that of females but is delayed by one generation. This oscillatory 
approach to equilibrium for an X-linked gene with different initial frequencies 
in the two sexes was first shown by Jennings (1916). It is shown graphically 
in Figure 2.5.2. 


2.6 Two Loci 


In Section 2.3 it was stated that, with random mating, the Hardy-Weinberg 
relation was attained in a single generation, irrespective of the number of 
alleles. But it is possible for each of two loci to be in random-mating frequen- 
cies, yet for them not to be in equilibrium with each other. We shall see that, 
contrary to the situation with a single locus, the equilibrium relation between 
two or more loci is not attained immediately, but gradually over a number of 
generations. We still assume that there is no selection. 

Consider two linked loci, with the recombination frequency between 
them equal to c. Unlinked loci can be regarded as a special case where c is 
1/2. Assume that there are 7 alleles at locus A and mat locus B. 


A alleles: A, A, A; A; A 
Frequency: pi P2 Di Dj Pn 
B alleles: B, B, B, B, B, 
Frequency: 4, q2 qx qi Qm 


Let P,(A; B,) be the frequency of the chromosome A; B, among the 
gametes produced in generation t. We want to inquire into the frequency of 
this chromosome in successive generations. For brevity, we shall designate 
P,(A; B,) simply as P,. 

The gametes produced in generation £ may be of two kinds; they may 
be the product of a recombination, with probability c, or they may not have 
been involved in a recombination at the preceding meiosis, with probability 
l — c. If no recombination has occurred, the probability of the 4; B, chromo- 
someis the same as it was in the preceding generation, which will be designated 
as P, ,. If a crossover has occurred, the A and B alleles in a gamete produced 
by the zth generation will have come from different gametes in the t — Ith 
generation; that is, one from the egg and one from the sperm that produced 
the individual whose gamete we are discussing. Since we are assuming random 
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mating, the egg and sperm are independent, and therefore the A locus and the 
B locus are independent. Thus, in the gamete produced by the tth generation, 
the probability of the A allele being A; and the B allele being B, is simply 
the product of the two gene frequencies, Piq- 

Putting these together, we have 


P, — (1 — e)P,-, + cpiqi, 2.6.1 
and after subtracting p; q, from both sides, we obtain 

P, — piq, = (l — cXP,-, — Pid): 2.6.2 

There is the same relation between P,_, and P,. , as there is between 
P, and P,_,. Therefore we can write 

P, — Pig = (l — c) (P, — Pid) 

- (1 — e}? (P... — Pid)» 

until finally, 

P, — Piq, = (1 — ce) (Po — Piq), 2.6.3 


where Pj is the initial value of the frequency of chromosome 4; B,. 

Thus the frequency of the A; B, chromosome gets closer and closer to 
the value p;q, ; each generation the departure from the final value is reduced 
by a fraction equal to the recombination value. For unlinked loci, the chromo- 
some frequency goes half way to the equilibrium value each generation. 
See Figure 2.6.1. 


P, — D,Q, 





Generation, t 


Figure 2.6.1. The rate of approach to gametic phase or 
linkage equilibrium. The ordinate is the difference between 
the frequency of a gamete type and its equilibrium value, 
P, — Piq. 
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The equilibrium value is approached only gradually, and theoretically 
is never attained. For comparison it is convenient to speak of the time 
required to go halfway to the final value, in the same way that one speaks 
of the halflife or median life of a radioactive element. 

The median equilibrium time is given by solving for t the equation 


(l-e =4, 
which leads to 
eS ae 264 
log(1 — c) 
When c is small, log,(1 — c) is very close to — c. So, using logs to the base e, 
log 1 — —.693, and t — .693/c approximately for small values of c. For exam- 


ple, with independent gene loci (c = 1), the median time is 1 generation by 
equation 2.6.4. When c=.1l, the approximation is satisfactory and t is 
.693/.1, or about 7 generations. When c = .0l,thetimeisabout69 generations; 
when c — .001, about 693 generations. 

We can summarize the main results of this section in two statements: 

1. The approach to equilibrium between coupling and repulsion phases 
of the gametes (usually called gametic phase equilibrium or “linkage” 
equilibrium) is gradual. The rate of approach depends on the rate of recom- 
bination between the two loci, and for independent loci is 50 % per generation. 

2. We can broaden the meaning of the Hardy-Weinberg principle by 
regarding chromosomes or gametes rather than single genes. With random 
mating the frequency of any diploid genotype is given by the appropriate 
term in the expansion of the square of the array of gamete frequencies. These 
frequencies are attained immediately if the gamete frequencies are the same 
in both sexes. 

These principles are illustrated by the following example. 


GENOTYPE IN TERMS OF FINAL EQUILIBRIUM IN 
GAMETE FREQUENCIES TERMS OF GENE FREQUENCIES 

A. BJ A. By Pi piqe 

A. B A, B, 2PaP j 2P Piqqi 

A, Bi/ A, B, 2PuPA 2pip qid: 


Therefore, at equilibrium. the two linkage phases (coupling and repul- 
sion) in the double heterozygote are equally frequent. 

Notice also that the frequency of any composite genotype can be obtained 
by simple multiplication. The frequency of the genotype 4,4;B,B,, for 
example, is (2p; pjY(q;); the frequency of A; A; B, B, is piggy; and A; A; B, B, 
is 2p; p)24,4). 
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Crossing over in the two sexes is not necessarily the same. The formula 
given earlier can easily be modified to take this into consideration. A (success- 
ful) gamete has equal chances of having come from a male or a female, 
since the total contribution of the two sexes is necessarily equal. Hence, 
letting cm and c, stand for the recombination fraction in males and females, 
the formula 2.6.1 becomes 


P, — M(l — e)P,-, + c,pid.] + HTC — Cp) Pres + c; pia] 
= (1 — C)P,-, T CP idk» 


where € = 4(c,, + Cy). 

Therefore, when crossing over differs in males and females it is sufficient 
to replace c in equations 2.6.2 and 2.6.3 by the mean value in the two sexes. 
In organisms like Drosophila and Bombyx, where meiotic crossing over is 
restricted to one sex (female in Drosophila, male in Bombyx), the approach 
to equilibrium is just half as fast as it would be if it occurred at the homo- 
gametic rate in both sexes. 

The results embodied in equations 2.6.2 and 2.6.3 were first given by 
Robbins (1918) who obtained them by considering all possible genotypes. 
The simpler and more elegant procedure used here is due to Malécot (1948). 

The results of this section show that, in a population that has been mating 
at random for many generations, a strong association between traits is not 
due to linkage unless the amount of recombination between the responsible 
loci is very small. Such persisting associations as are found in natural popula- 
tions are probably due to pleiotropy or to recent amalgamation of populations 
with incomplete mixing, that is to say nonrandom mating. 

With selection and epistasis, especially with strong interaction and close 
linkage, there can be permanent gametic phase disequilibrium. We discuss 
this in Chapter 5. 


2.7 More Than Two Loci 


It is possible for each group of two among a set of three loci to be in random 
combination, yet for the whole set not to be random. We should expect, 
however, that with random mating all gametic combinations would eventually 
reach equilibrium proportions. That this is in fact correct has been shown, 
first for three loci by Geiringer (1945), and more generally by Bennett (1954a). 

The reader who trusts his intuition on this point is invited to skip the 
rest of this section. For the skeptic we give the derivation for three loci. 

Let P,(ABC) be the probability that a gamete from generation t carries 
alleles 4;, B;, and C, at the three loci under consideration. Let p;, g;, and 
r, be the frequencies of these three alleles. No restriction is placed on the 
number of alleles at any locus; but we are discussing a particular one from 
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each. For simplicity of writing we drop the subscripts since we refer to only 
one allele at each locus. 

We designate by a the frequency of the recombinational event of the 
type that separates A from B and C, b that of the type that separates B, and 
c that for C. For example, if the gene order is ABC, a is the probability of a 
crossover between the A and B loci, and b is that of a double crossover. 
In the special case of independent loci, a = b = c = 1/4. We let K = 1 —a — b 
— c be the probability of a nonrecombinant gamete. 

We can obtain an ABC gamete in four ways: 

(1) From an ABC chromosome that has not undergone any crossover. 
The probability is K times the frequency of this chromosome in the gametes 
produced by the preceding generation. 

(2) From a recombination that combines B and C from one parent with 
A from the other. The probability of this is ap, P(BC) = apP(BC), where 
P(BC) is the probability of a gamete carrying B; and C, in the previous 
generation irrespective of whether it carries A or not. 

(3) Recombination that combines B; with A; and C,, with probability 
bqP(AC). 

(4) A recombination that combines C with A and B, with probability 
cr P(AB). 

Putting these together, we obtain 

P(ABC) = KP,_,(ABC) + apP,-,(BC) + bqP, .,(AC) + erP,_,(AB). 
We now subtract from the left side of the equation the quantity 

P[P(BC) — qr] + qLP(AC) — pr] + rLP(AB) -- pq] 
and from the right side the equivalent quantity 

Pl — b — c)[P,_,(BC) — ar] + g(1 — a — c)[P,- (AC) — pr] 

+ r(1 — a — 5)[P,-1(AB) — pa]. 
That these two expressions are equivalent can be seen by reference to equation 
2.6.2. After making these subtractions we note that, since K —1—a— b — c, 
l—-b-—-c=K+a, —a—c-K-cb, and 1—a—b=K+c. Making 


these substitutions in the right side, subtracting pgr from both sides, and 
rearranging, we arrive at the following equation: 


[P(ABC) — pqr] — pL P(BC) — gr] — qLP(AC) — pr] — r[LP(AB) — pq] 
= K{[P,-,(ABC) — par] — p[P, (8C) — qr] — q[P. -,(AC) — pr] 
— r[P, .,(AB) — pq]). 2.7.1 
If we represent the left side of 2.7.1 by the function L (ABC), we can write 
L (ABC) = KL,_,(ABC) 
= K'L,(ABC). 2.7.2 
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The intuitive conclusion that linkage equilibrium is approached is 
proven by noting first that, since K < 1, L (ABC) approaches 0. Furthermore, 
from 2.6.1, each of the terms such as P,(AB) — pq approaches 0. Thus P,(ABC) 
must approach pqr. 

The extension to more than three alleles can be done in a similar way. 
Bennett (1954a) has shown how to define a function such as L (ABC) for 
four or more loci which satisfies an equation like 2.7.2. 


2.8 Polyploidy 


In the main this book will be restricted to diploid populations (or, sometimes 
for simplicity, haploids). However, we shall occasionally make brief comments 
about polyploid populations, usually without proof, but with references to 
the pertinent literature. 

Polyploid inheritance is complicated by the large number of essentially 
different modes of gamete formation, determined not only by the frequency of 
recombination between two loci but also by the recombination between 
each of these and its centromere. These modes have been classified in a system- 
atic way by Fisher (1947a). 

The principle of Sections 2.6 and 2.7, that with random mating the 
chromosome frequencies approach the product of the component allele 
frequencies, is also true for polyploids. Bennett (1954) has shown this for 
two loci in tetraploids and hexaploids and has given explicit formulae for 
the frequencies at any time in terms of the initial frequencies and the probabili- 
ties of the different modes of gamete formation. Bennett's methods are rather 
sophisticated and an elementary derivation for tetraploids has been given by 
Crow (1954). 

A peculiarity of polyploid inheritance is that the simple Hardy-Weinberg 
principle is not always true. The single-locus equilibrium is not attained in a 
single generation, as with diploids, but is approached gradually. Furthermore, 
except for a locus that is completely linked to its centromere, the equilibrium 
zygote frequencies are not the products of the gene frequencies. We shall 
illustrate this for a tetraploid, following the methods of Bennett (1968). 

Assume that there are n alleles at this tetraploid locus, with frequencies 
Pis P2» pO Let Pj; , be the frequency of the A;A; (diploid) gamete in 
generation t. Likewise let 2P;; , be the frequency of an A; A; gamete. We shall 
designate the frequency of double reduction by the traditional symbol, a. 
By this, we mean the probability of a homoallelic gamete from a plant of 
genotype ÁÁ, A4 A4 ;itisa function of the amount of recombination between 
the locus and the centromere. 

If there has been double reduction, the gamete is necessarily homo- 
allelic and the probability of its being A; A; is simply the frequency of this 
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gene in the population. In the 1 — a cases where double reduction does not 
occur, two of the four chromosomes are chosen. In 1/3 of the cases, the two 
genes will have come from the same gamete in the previous generation, in 
which case the probability of their both being A; is the same as in the gametes 
of the previous generation. In the other 2/3, the two genes will come from 
different gametes in the previous generation, in which case they are independ- 
ent and the probability of their both being 4; is p?. Putting all this together, 


Pii, = (l — GP, ici + SPP) + opi. 28.1 
Similarly, 
Pj r=(1-— ay Pi, t-i + $DiP;)- 2.8.2 


At equilibrium, P;; ,— Pij, i = P3. Making this substitution in 2.8.2 
leads to 


P,=(1—-S)pip;, f= 3a/(2 + a). 2.8.3 
Likewise, 
P;;=(1 —S)p? + fpi. 2.8.4 


Therefore, at equilibrium the proportion of heteroallelic gametes is 
reduced by a fraction f, and the homozygotes are correspondingly increased. 
We have used the letter f intentionally, because equations like 2.8.3 and 2.8.4 
will occur in Chapter 3 when we consider the reduction in heterozygosity 
caused by inbreeding in diploids, and f corresponds to the coefficient of 
inbreeding. 

When the gene is near the centromere, « approaches 0. When the gene 
is more and more distant from the centromere, « approaches 1/7 as a limit. 
When a = 1/7, f = 1/5; so we can say that the limiting decrease in hetero- 
allelic gametes is 1/5. 

The Hardy-Weinberg principle holds true in the sense that the zygote 
frequencies are given by the products of the appropriate gamete frequencies, 
just as in diploids. Thus the frequency of a zygote of type 4; A; 4; 4; would 
be 2(P;,)(2P;,). 

Finally, by subtracting P;; from both sides of 2.8.2 and simplifying we 
find 


M 1—a« 
Pia fi 3 (Pij,r-1 — Pj) 


y a 


which shows the rate at which the equilibrium is approached. 





2.8.5 
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2.9 Subdivision of a Population: Wahlund's Principle 


In nature or in domestic animals and plants the population is often structured. 
One possibility is that it is divided into subpopulations or isolates, between 
which there is partial or complete isolation. In such strains the gene frequen- 
cies may diverge, either because of different environments that favor different 
genotypes, or simply by chance if the subpopulations are small. We inquire 
into the effect of amalgamation of previously isolated subpopulations. 

The key relationship is expressible in terms of the variance in gene 
frequencies among the subpopulations. The formula was discussed by 
Wahlund (1928) and is often called Wahlund's principle. 

Imagine that there are k subpopulations, completely isolated from 
each other and of sizes n,, n4, ... and n,. Mating within each subpopulation 
is assumed to be at random. Let the frequency of allele A be p,, p;, ... and 
p, in these subpopulations. Then the mean proportion of AA homozygotes 
in the whole population is 


2 2 
nipi + nap? +t’ +n pk -A 
ntn toot l 





2.9.1 


Now suppose that these populations are pooled into a single panmictic 
unit. The average frequency of the A allele is now (as before) p, the weighted 
average of the frequencies in the different populations. Then the proportion 
of AA homozygotes in the pooled population after one generation of random 
mating is p?. 

Recall that the variance is V, =p? — p^ (see A.2.2 and A.2.4). Hence 


p =p — V,, 2.9.2 


where V, is the variance in the frequency of the gene A among the k sub- 
populations. 

This explains why the proportion of individuals with recessive traits 
is reduced by migration between previously isolated communities. Since the 
variance is always positive, there will always be a decrease unless the gene 
frequency is identical in the subpopulations. The magnitude of the decrease 
will depend on the diversity of frequencies among the populations, as measured 
by the variance. 

The previous discussion has referred to a situation where two or more 
populations are pooled, and then matings occur without regard to the origin 
of the individuals. The situation is somewhat different if the first matings are 
all between individuals from different populations. We shall consider only 
two populations. 

If p, and p; represent the frequency of the A gene in the two populations, 
the proportion of AA homozygotes in the F, hybrids is p,p;. The gene 
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frequency in the F, is the mean of the two parent population frequencies, or 
p(p, + p2)/2. The variance in the two original populations (equally 
weighted) is 


V, = ipi T ip = p = i - pi. 2.9.3 


Note that p? — V, = p, p;, which is the frequency of 44 homozygotes in the 
F, population. 

For comparison, the proportion of 44 homozygotes in the three popula- 
tions is: 


(1) Separate populations p +V, 
7_V 
p? 


(2) F, population 
(3) F, and later 


"Sl! 


2 


"SI 


Hybridization between two populations causes an initial decrease in 
homozygosity, followed by a rise to a point halfway between. This argument 
does not consider linkage, the effect of which is to slow the approach to 
the final value. 


2.10 Random-mating Proportions in a Finite Population 


The Hardy-Weinberg proportions are realized exactly only in an infinite 
population. For one thing, a finite population is subject to chance deviations 
from the expected proportions. There is also a systematic bias because of the 
discreteness of the possible numbers of different genotypes. The bias can 
become important if there are a number of individually very rare alleles. 
For example, one might determine the allele frequencies from a natural 
population and then wish to inquire if these are in random-mating proportions. 
The problem has been considered in detail by Hogben (1946) and Levene 
(1949). 

Consider a population of size N. Since we are considering diploid 
populations, there are 2N genes per locus. Let p; be the proportion of allele 
A, in this population; hence there are 2Np; representatives of the A; gene. 
Then we regard the zygotes as made up by combining these 2N genes at 
random in pairs. The probability of drawing an 4; allele is 2Np,/2N; after this 
is done, the probability of drawing another A; allele from the remaining genes 
is (2Np; — 1)/(2N — 1). Thus the expected proportion of A; A; individuals, 
given that there are exactly 2Np, A; alleles, is 


P(A; Aj) = SN ONT C pi — p(1— pM. 2.10.1 
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where f= 1/(2N — 1). Likewise the expected proportion of 4,4; hetero- 
zygotes is 


= 22. LII p. L app i 10. 
P(A)= NNT tN 2N-1 PA tf) SS 


Thus the heterozygotes are increased by a fraction f= 1/(2N — 1) and 
the homozygotes are correspondingly decreased, in comparison with the 
proportions in an infinite population with the same allele frequencies. 

As a simple example, consider a population with only two alleles at 
the A locus, A, and A,. Assume further that the A, allele is represented only 
once. Thus p, = 1/2N and p, = l — p,. Substituting these values into 2.10.1 
we obtain 0 for the frequency of A,A,, as we should; for if there is only one 
A, gene there can be no homozygotes. Furthermore, substituting into 2.10.2 
leads to a frequency of A,A, heterozygotes of 1/N; this is also correct, since 
only one heterozygote exists in the population of N individuals. 

In Chapter 3 we shall see that within a finite population there is an 
opposite effect, a decrease in heterozygosity. However, this decrease is 
strictly due to changes in the gene frequencies due to random gains and losses 
in a small population. Within the population the relation between the gene 
and genotype frequencies is given by the Hardy-Weinberg principle, with the 
slight correction given here. 


2.11 Problems 


In all problems, unless the contrary is stated, assume random mating. 

]. In a population there are 8 times as many heterozygotes as homozygous 
recessives. What is the frequency of the recessive gene? 

2. Show that, for a very rare recessive gene, the proportion of heterozygous 
carriers is approximately twice the frequency of the recessive gene. 

3. If 16% of the population are Rh— (dd), what fraction of the Rh+ 
population (DD and Dd) are homozygous? 

4. From the data in problem 3, what fraction of the children from a large 
group of families where both parents were Rh+ would be expected to 
be Rh ? 

5. Show that if the A and B antigens of the ABO blood group system were 
caused by two dominant genes, independently inherited, the product of the 
frequency of A and B should equal the product of O and AB. 

6. What is the maximum proportion of heterozygotes with two alleles? 
With three alleles? With n alleles? (See A.10) 

7. From the data given on color blindness, what fraction of women would 
be of normal vision, but carrying two different color-blind factors? 
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. Show that in a randomly-mating population with two alleles half the 


heterozygotes have heterozygous mothers. 


. Here are some hypothetical data on the frequencies of the ABO blood 


groups: 
Genotype: OO OA AA OB BB AB 
Frequency: .40 .30 .08 .12 .04 .06 


What would the frequencies be next generation if mating were at random? 


. Letting p, q, and r stand for the frequencies of the 4, B, and O blood 


group alleles, what is the probability that two persons chosen at random 
have the same blood group? 


. An outrageously careless hospital gives transfusions at random. What 


proportion would be mismatches? (To refresh your memory, group O 
can give to anyone, A to A or AB, B to B or AB, and AB only to AB.) 


. Show that if p is the frequency of a recessive allele, the average propor- 


tion of recessive children when one parent is of the dominant phenotype 
and the other of the recessive phenotype is p/(1 + p) and when both 
parents are of the dominant phenotype is [p/(1 + p)]?. 


. The two ratios, p/(1 + p) and [p/(1 + p)]^, are sometimes called Snyder's 


ratios and the fact that one is the square of the other is sometimes used 
as a test for recessive inheritance. Does this discriminate between a 
trait caused by a single pair of recessive genes and one caused by simul- 
taneous homozygosity for several recessive genes? 


. Does the answer to problem 13 depend on whether the genes are in- 


dependent or not? Must they be in gamtiec phase equilibrium ? 


. A. G. Searle (J. Genet. 56: 1-17, 1959) reports the following frequencies 


of coat colors of cats in Singapore. The observed numbers were as 
follows: 


FEMALES MALES 
DARK CALICO YELLOW DARK YELLOW 
+/+ +/y yly + y 
63 55 12 74 38 


Use the maximum-likelihood method to compute the gene frequency and 
test by Chi-square the agreement with the hypothesis of random-mating 
proportions. 


. Give an example of a set of gamete frequencies for three loci such that 


any two are in linkage equilibrium, but the set of three are not in equi- 
librium. 
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. Genes A and B are linked with 20% recombination between them. An 


initial population is composed of AB/AB, AB/ab, and ab/ab plants in 

the ratio of 1:2: 1l. The population is allowed to pollinate at random. 

a. What would be the frequencies of the four kinds of chromosomes in 
the next generation? 

b. What would be the frequency of the AB/aB genotype in the next 
generation ? 

c. What would be the chromosome frequencies when equilibrium is 
reached? 

d. What would be the frequency of the AB/aB genotype at equilibrium? 

e. How many generations would be required for the population to go 
halfway to equilibrium? 


. Two homozygous strains aa bb and AA BB are crossed. The A and B 


loci are on separate chromosomes. Show that these loci are in linkage 
equilibrium in the F; generation. Why doesn't equation 2.6.3 apply? 


. Show that in an autotetraploid the value of « is 1/7 as the locus becomes 


far enough from the centromere to be independent of it. 


If p,, P2, P3, and p, represent the frequencies of alleles A,, A,, 44, and 
A, in a randomly mating tetraploid population that has reached equi- 
librium, and the relevant locus is very far from the centromere, what will 
be the frequency of 4,4,4,4; plants? A,A,A, 4; ? A,A: A44, Aq? 

The equations po — JO, Pa=l- JB+ O, and pg - 1 cA +0, 
where O, A, and B represent the frequencies of these three blood groups, 
are often used to estimate the gene frequencies. Show that these are not 
the solutions to the maximum-likelihood equations. Derive these equations 
from the relations below equation 2.3.2. 

Two plausible hypotheses that explain the much greater incidence of 
early baldness in males than in females are (1) an autosomal dominant 
that is normally expressed only in males and (2) an X-linked recessive. 
If the first hypothesis is correct, and q is the frequency of the gene for 
baldness, what proportion of the sons of bald fathers are expected to be 
bald? What proportion from nonbald fathers? What are the correspond- 
ing expectations on the X-linked recessive hypothesis? 


Harris (Ann. Eugen. 13: 172-181, 1946) found that 13.3% of males in 
a British sample were prematurely bald. He also found that of 100 bald 
men, 56 had bald fathers. Show that this is consistent with the sex- 
limited dominant hypothesis but not the sex-linked recessive. (You may 
want to satisfy yourself that the expected fraction of bald sons when the 
father is bald is the same as the expected fraction of bald fathers when the 
son is bald. It is easier to get data by selecting a group of bald men and 
inquiring about their fathers than it is to wait for their sons to grow up.) 
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24. Show that if a group of previously isolated populations are pooled the 
proportion of heterozygotes for alleles A; and A; is equal to the average 
proportion of heterozygotes before pooling minus twice the covariance 
of the 2-allele frequencies. Show also that when there are only two alleles 
the covariance is minus the variance. 

25. Prove the statements in the legend of Figure 2.11.1. Assume that the base 





Figure 2.11.1. Representation of a 
population as a point in a 2- 
dimensional diagram. P, H, and R 
represent the frequencies of the 
genotypes AA, Aa, and aa; p and q 
are the frequencies of the 4 and a 
alleles. P is given by the distance from 
the vertical axis, R by that from the 
horizontal axis. Af is given by either 
the horizontal or vertical distance to 
the hypotenuse of the triangle. All 
possible populations lie within the 
triangle; populations in Hardy- 
Weinberg ratios lie along the parabola. 


and altitude are each equal to 1. In particular, show that P + H+ R=1, 
and that the perpendicular line from the point to the hypotenuse divides 
it in the ratio p : q. Show also that the equation of the Hardy-Weinberg 
parabola is P? — 2PR + R?—2P - 2R*- 129. [You might find it 
useful to note that, with random-mating proportions, H? = 4PR.] 
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26. Another way of representing a population was used by De Finetti (1926). 


27, 


28. 


Prove that, if the altitude of the triangle is 1, P + H + R = 1. Show also 
that the perpendicular from the point to the base divides it in the ratio 
P : q. See Figure 2.11.2. 








Figure 2.11.2. De Finetti diagram of 
a population in triangular coordinates. 
P, H, and R are given by the 
perpendicular distances to the three 
sides. The vertical line divides the 
base in the ratio of the gene 
frequencies, p, and q. Points in 
Hardy-Weinberg ratios lie along the 
curve. 


Use Wahlund's principle or the definition of the variance to show that 
with 7 alleles the minimum homozygosity with random mating occurs 
when all alleles are equally frequent. 

Assume two segregating loci, each with two alleles. Let Pyg, Pay, Pos, 
and P,, be the frequency of the four chromosomes and c the amount of 
recombination between the two loci. A conventional measure of linkage 
disequilibrium is D = P4g P, — Pas Pag. What is the equilibrium value 
of D? How fast is the equilibrium approached? 


INBREEDING 


be if they had been chosen at random from the population. Related 

individuals have one or more ancestors in common, so the extent of in- 
breeding is related to the amount of ancestry that is shared by the parents of 
the inbred individuals. Alternatively stated, the degree of inbreeding of an 
individual is determined by the proportion of genes that his parents have in 
common. 

An immediate consequence of this sharing of parental genes is that the 
inbred individual will frequently inherit the same gene from each parent. 
Thus inbreeding increases the amount of homozygosity. So one observable 
effect of inbreeding is that recessive genes, previously hidden by heterozy- 
gosity with dominant alleles, will be expressed. Since most such genes are 
harmful in one way or another, inbreeding usually leads to a decrease in 
size, fertility, vigor, yield, and fitness. There are also likely to be loci segre- 
gating in a population where a heterozygote is fitter than either of the two 


i nbreeding occurs when mates are more closely related than they would 
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corresponding homozygotes. In this case, too, inbreeding leads to a decreased 
fitness. 

Another consequence of consanguineous mating within the population 
is greater genetic variability, since similar genes tend to be concentrated in 
the same individuals. Usually, because of the correlation between genotype 
and phenotype, this leads to an increase in phenotypic variability. 

Inbreeding may follow either of two patterns. There may be a certain 
amount of consanguineous mating within a population, with the consequences 
just mentioned. On the other hand, the inbreeding may be such as to break 
the population into subgroups. An extreme example is continued self- 
fertilization in which the population (if it is of constant size and each parent 
contributes equally to the next generation) is divided into a set of subpopula- 
tions of one individual each. Likewise, a pattern of repeated sib mating could 
lead to a series of isolated populations of size 2. As a third example, there may 
be a natural population which is divided into isolated subpopulations, within 
each of which mating is random or nearly so. The effect will be that each 
subpopulation becomes more homozygous, and therefore the whole population 
does. The individual subpopulations become more uniform genetically; but, 
since they become homozygous for different genes, the population as a whole 
becomes more variable. Of course there may be only partial isolation, with 
intermediate consequences. 

A point that at first seems paradoxical is that within a subpopulation 
there is an increase in homozygosity despite the fact that mating within this 
group is random. The reason, as will be discussed in Section 3.11, is that there 
are random changes in the frequencies of the individual alleles and these, 
on the average, lead to a decrease in heterozygosity. As an extreme example, 
self-fertilization can be regarded as random mating (i.e., random combination 
of gametes) within a population of one. The gene frequencies at different, 
previously heterozygous loci change from 1/2 to O or 1. 

Whether inbreeding leads to subdivision or not, it can be measured in 
the same way—by Wright's (1922) coefficient of inbreeding, f, which measures 
the proportion by which the heterozygosity has been decreased. As we shall 
show later, other population properties can also be related to f. However, 
before discussing f, we shall illustrate with two simple examples the effect of 
continued inbreeding. 


3.1 Decrease in Heterozygosity with Inbreeding 


The qualitative effect of continued inbreeding can be seen by examining the 
most extreme form, self-fertilization. In a self-fertilized population the 
progeny of homozygotes are like their parents, whereas the progeny of hetero- 
zygotes are 1/2 heterozygotes and 1/4 each of the two homozygous types. 
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Thus, in each generation the proportion of heterozygous loci is reduced by 
half and the homozygous types are correspondingly increased. This is illus- 
trated in Table 3.1.1. 


Table 3.1.1. The changes in the probabilities of different genotypes with continued 
self-fertilization. D, H, and R stand for the initial proportions of dominant, 
heterozygous, and recessive types. 


FREQUENCY OF GENOTYPE 





GENERATION 
AA Aa aa 
0 D H R 
1 D + HIA H/2 R+H/4 
2 D+3HA/8 H/4 R + 3H/8 
3 D 4- 7H/16 H18 R + 7H/16 
4 D + 15H/32 H/16 R + 15H/32 
Limit D + H/2 0 R + H[2 


If Ho is the initial proportion of heterozygotes, the proportion after t 
generations of self-fertilization is Ħọ/2‘. If the original population were 
panmictic, with AA, Aa, and aa genotypes in the proportions p°, 2pq, and 
q^ (p+q=1), the individual lines eventually become homozygous. The 
probability of being AA is D + H/2 = p! + pq =p(p +q) =p; likewise the 
probability of being aa is q. Thus, the population becomes broken into sepa- 
rate lines, each homozygous for one or the other of the genes in the ratio of 
their original frequencies in the population. Notice one other fact: There has 
been no change in the gene frequency. Inbreeding per se does not change the 
proportions of the various genes, only the way they are combined into homo- 
zygous and heterozygous genotypes. 

With less extreme forms of inbreeding the results are similar, though the 
change in heterozygosity is less rapid. The results for continued brother- 
sister mating are shown in Table 3.1.2. Again there is a decrease in the propor- 
tion of heterozygotes, with the amount deducted being divided equally and 
added on to the two homozygous types. 

These results may be obtained by writing out all the possible matings 
generation after generation, as was done by the early investigators (Fish, 
1914; Jennings, 1916). This and several other systems of recurrent inbreeding 
were worked out by these authors. The papers are now mainly of historical 
interest since more general methods are available. We shall discuss them 
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Table 3.1.2. The decrease in heterozygosity with successive generations of brother- 
sister mating. 





RELATIVE DECREASE IN RATES OF CHANGE IN 
GENERATION HETEROZYGOSITY HETEROZYGOSITY HETEROZYGOSITY 
H, Ho — Hi H, H,- — Hi 
] Ho xd Ao =v Ayes H, -1 
0 I 0 1 0 
l 2/2 0 1 0 
2 3/4 1/4 3/4 —.750 .250 
3 5/8 3/8 5/6 =.833 .167 
4 8/16 8/16 8/10 — .800 .200 
5 13/32 19/32 13/16 = .812 .188 
6 21/64 43/64 21/26 — .808 .192 


Limit 0 1 A = 809 19] 


in Sections 3.4 and 3.8, where the results of this table will appear as a special 
case. 

In this example the heterozygosity follows a simple rule. The numerator 
in successive generations is given by the Fibonacci series in which each term 
is the sum of the two preceding terms, while the denominator doubles each 
generation. The number | in the second row is written as 2/2 to make the 
sequence more obvious. The reduction in heterozygosity, expressed as a 
fraction of the initial heterozygosity, is the same regardless of the initial gene 
frequencies and, as we shall show later, the number of alleles. 

The relative heterozygosity, 77,/7;, has been called by Wright (1951) 
the panmictic index, for which he used the letter P. 1 — P is the inbreeding 
coefficient, for which Wright has used the letter F. (We shall use the lower 
case f in order to reserve F for multiple-locus inbreeding effects.) 

The last two columns give the rate of change in heterozygosity. Notice 
that the ratio H,/H,_, after a few oscillations rapidly approaches a constant 
value. The limiting value of the ratio of heterozygosity to that in the previous 
generation is usually designated by 4 (Fisher, 1949). 


3.2 Wright's Inbreeding Coefficient, f 


Wright’s (1922) original derivation of the inbreeding coefficient, f, was through 
correlation analysis. An alternative approach using only probability rules has 
been developed by Haldane and Moshinsky (1939), Cotterman (1940), and 


INBREEDING 65 


Malécot (1948). They distinguish between two ways in which an individual 
can be homozygous for a given locus. The two homologous genes may be: 
(1) alike in state, that is to say, indistinguishable by any effect they produce 
(or perhaps, when molecular genetics has become sufficiently precise, alike in 
their nucleotide sequence), and (2) identical by descent, in that both are derived 
from the same gene in a common ancestor. 

We follow the notation of Cotterman in designating an individual whose 
two homologous genes are identical by descent as autozygous. If the two alleles 
are of independent origin (as far as known from our pedigree information), 
the individual is a/lozygous. The effect of inbreeding is to increase that part 
of the homozygosity that is due to autozygosity. (Notice that an individual 
can be homozygous without being autozygous, if the two homologous genes 
are alike in state but not identical by descent. Conversely, an autozygous 
individual can be heterozygous for this locus if one of the two alleles has 
mutated since their common origin, although this is negligibly rare if only a 
small number of generations is being considered.) 

The inbreeding coefficient, f, is defined as the probability that the indi- 
vidual is autozygous for the locus in question. Alternatively stated, it is the 
probability that a pair of alleles in the two gametes that unite to form the 
individual are identical by descent. 

An individual with inbreeding coefficient f has a probability f that the 
two genes at a particular locus are identical and a probability 1 — f that they 
are not identical, and therefore independent. If they are independent the 
frequencies of the genotypes will be given by the binomial formula. If they 
are identical, the frequencies of the gene pairs will be simply the frequencies 
of the alleles in the population. Thus, for two alleles, A, and A,, with 
frequencies p, and p; (p, + p; = 1), the genotype frequencies are: 


ALLOZYGOUS AUTOZYGOUS 
Homozygous, 41A:: pl-N) + pf 
Heterozygous, A142: 2pipx(1 — f) oe 
Homozygous, A2 A2: pl—f) + PS xi 
Total 1—f 


Notice that when f is O these formulae reduce to the usual Hardy- 
Weinberg proportions. When f= | the population is completely homozy- 
gous. Thus f ranges from 0 in a randomly mating population to | with complete 
homozygosity. How to compute f from a pedigree will be shown later. 

Multiple alleles introduce no difficulty. The genotype frequencies are a 
natural extension of the results for two alleles. The frequencies are 


Aidi: PRU- f) + pif 3.2.2 
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for homozygous genotypes, and 


AiA; : 2pipl — f) 3.2.3 


for heterozygous genotypes. 


The expected proportion of heterozygous genotypes with inbreeding 
coefficient f, H,, is given by 


o~ Hy 
, 


H 
Hy = ¥ pipl -fy=H(l-f); f= H, 3.2.4 


where Ho is a constant equal to the proportion of heterozygotes expected 
with random mating (f= 0). The summation is over all combinations of 
values of i and j except when these are equal. 

This proves the assertion made earlier that the inbreeding coefficient 
measures the fraction by which the heterozygosity has been reduced. We have 
written the formula as if, when f = 0, the population is in Hardy-Weinberg 
proportions. However, for any measured f (as determined, for example, from 
a pedigree), the heterozygosity, H, is Ho(1 — f), where Hg is whatever the 
heterozygosity would have been in the absence of the observed inbreeding. 
To be concrete, the inbreeding coefficient for the child of a cousin marriage is 
1/16 (as we shall show later); therefore the child of such a marriage is 15/16 
as heterozygous as if his parents had the same relationship as a random pair 
in this population. 

There is a simple relationship between the correlation coefficient, r, 
and the inbreeding coefficient, f. If we assign numerical values to each allele, 
then the inbreeding coefficient, f, is the correlation between these values in 
a pair of uniting gametes. In fact, Wright's original derivation of the inbreeding 
coefficient was through correlation methods. 

The relationship between r and f can be shown in the following way. 
For convenience we assign the value | to allele A, and 0 to allele 45, though 
we would get the same result with any values. The calculations are shown in 
Table 3.2.1. 

Since the sum of the genotype frequencies is equal to !, the weighted 
sum and the mean of any value are the same. For example, the sum (and 
mean) of the egg value, X, is [p3(1 — f) + p2f KO) + lpp: — XH) + 
[pop (1 — (00) + [pA — f£) p, f (0), which after some algebraic simpli- 
fication reduces to p,. The other calculations are given in the table, using the 
standard formula for calculation of r given in A.4.3. 

The calculations in Table 3.2.1. are made by assuming that there are 
only two alleles and letting them have the values 0 and 1. The correlation 
interpretation of f, however, is completely general. Table 3.2.2 gives the same 
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Table 3.2.1. Demonstration of the equivalence of the inbreeding coefficient, f, 
and the coefficient of correlation, rxy, between the genetic values of the uniting 
gametes. 


FREQUENCY OF VALUE OF 
EGG SPERM THIS — x? Y? XY 
COMBINATION EGG SPERM 
X Y 
Az A2 pi —f)tpif 0 0 0 0 0 
A A  ppxd— f) 1 0 1 0 0 
A; A p2pi(1—f) 0 1 0 1 0 
A A píü-f-c4pf 1 1 1 1 1 
Sum or Mean 1 pi pi pn pi pil —f)t*pnf 


X= pipx(l —f)+ pil —f)+pi f= pr. 
Likewise, F = X? = Y? = pi. 


XY — XY pX1— f) pi f— pi 


demonstration without restriction as to number of alleles and letting the 
contribution of the alleles differ. Furthermore, if the genic values are summed 
over k loci the covariance will be 


DON LE 


where p, and a;, are the frequency and value of the ith allele at the kth locus. 
This is f times the variance. Hence f is the expected value of the correlation 
between the genetic values of two uniting gametes, regardless of the number 
of loci and number of alleles under consideration. 

The equivalence of r and f suggests an interpretation of the correlation 
coefficient. If a measurement can be thought of as being the sum of a number 
of elements, then the correlation coefficient is the measure of the fraction 
of these elements that are common to the two measurements, the other ele- 
ments being chosen at random. This interpretation is useful in many branches 
of science. In quantitative genetics the elements can obviously be interpreted 
as cumulatively acting genes. 

The computation of f will be discussed in Section 3.4. 
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Table 3.2.2. Demonstration of the equivalence of the inbreeding coefficient and 
the correlation between the genetic value of the uniting gametes regardless of the 
contribution of the individual genes and the number of alleles. The contribution, 
or value, of allele 4, is assumed to be a;, measured as a deviation from the mean 
value. 


FREQUENCY VALUE OF 
EGG SPERM OF THIS zoo AX yp? XY 
COMBINATION EGG SPERM 
X Y 
A A pd-Nt+pSf a a ai ai ai 
A, A; pipdl — f) a ay ai aj aa; 


Vx = 2, piat, 


since the variance of the egg value is the sum of the squares of the allele values, 
each weighted by its frequency. Vy is the same. 


Covxy = (1 -PLÈ piat + 2 Pipsaas] +f? pia. 
i+ i 


But the quantity in brackets is equal to [È p:ai]? which is equal to 0, because 
the sum of the deviations from the mean is 0. Therefore, 


Covxy =f > piat. 


The correlation coefficient, being the ratio of the covariance to the geometric 
mean of the two variances (which in this case are the same), is f, as was to be 
shown. 


rxy = Covxy/Vx =f. 


3.3 Coefficients of Consanguinity and Relationship 


We have used the inbreeding coefficient of an individual /, fy, to give the 
probability that two homologous genes in that individual are identical by 
descent. Or, as just shown, this is the correlation between the genetic value 
of the two gametes that united to produce the individual. Since inbreeding 
of the progeny depends on the consanguinity of the parents we can use the 
inbreeding coefficient as a measure of this. 

We define the coefficient of consanguinity, fiz, of two individuals 7 and 
J as the probability that two homologous genes drawn at random, one from 
each of the two individuals, will be identical. The answer to this is clearly the 
same as the inbreeding coefficient of a progeny produced by these two indi- 
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viduals. Hence the inbreeding coefficient of an individual is the same as 
the coefficient of consanguinity of its parents (Malécot, 1948). 

There is a bewildering plethora of alternative names for this coefficient. 
Malécot, who introduced the idea, called it the coefficient de parenté. Falconer 
(1960) calls it the coancestry. Kempthorne (1957) translated parenté into 
parentage. Malécot himself has, on at least one occasion, translated it into 
kinship. We shall use either consanguinity or kinship. 

A different measure of relatedness, introduced much earlier and still 
widely used, is Wright's (1922) coefficient of relationship, r,, , defined as: 


_ 2frs 
ry = HE Im 3.3.1 


J+ fo + f) 


For two individuals that are not inbred, the coefficient of relationship is 
exactly twice the coefficient of consanguinity. 

As we shall show later, the coefficient of relationship is the correlation 
between the genic, or genetic, values of the two individuals. If the genes 
act without dominance or epistasis, and there is no effect of the environment 
on the trait being measured, this is the expected correlation. We shall also 
show later the effect of dominance on the correlation between relatives 
(Section 4.3). 


3.4 Computation of f from Pedigrees 


The procedure for computing the inbreeding or consanguinity coefficient 
from a pedigree follows directly from the definition of f. Consider the pedigree 
in Figure 3.4.1. 

In this pedigree individual 7 is inbred because both his parents are 
descended from a single common ancestor, A. All unrelated ancestors, which 
are irrelevant to the inbreeding of J, are omitted from the pedigree. We ask for 
the probability that 7 is autozygous;i.e., that the homologous genescontributed 
to / by gametes b and e are both descended from the same gene in ancestor A. 
We shall use the notation Prob(c — b) to mean the probability that c and b 
carry identical genes for the locus under consideration. 

Prob(c = b) = 1/2, since the gene in b has an equal chance of having 
come from C or from B’s other parent. Likewise, Prob(c = a) = 1/2. The 
probability that a and a' carry identical genes may be obtained as follows: 

Let the two alleles in A be called W and Z. Then there are four equally 
likely possibilities for gametes a and a': (1) W and W, (2) Z and Z, (3) W and 
Z, and (4) Z and W. In the first two cases they are identical, so the probability 
is 1/2 that a and a’ get the same gene from A. However, there is an additional 
possibility if ancestor A is inbred, for in this case the two alleles W and Z 
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may both be descended from some more remote ancestor not shown in the 
figure. The probability that A is autozygous, is, by definition, the inbreeding 
coefficient of A, f,. Altogether, if A is inbred, Prob(a = a) 241 -1f,-— 
4(1 + fa); if A is not inbred Prob(a = a’) = 1/2. 
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Figure 3.4.1. A simple 
pedigree with inbreeding. 
Circles and squares 
denote females and males 
respectively. Eggs and 
sperms are designated by 
small letters. Ancestors 
that do not contribute to 
the inbreeding of Z are 
omitted. 


Continuing around the path BCA DE, Prob(a' = d) = Prob(d = e) = 1/2. 
Summarizing, b and e will carry identical genes only if b, c, a, a’, d, and e do 
so. Therefore, since all these probabilities are independent 


Si =fee=Probib=e)= x 4 x +A x d x 4 
b=c c=a a-a' a =d d=e 


= Q» + fy). 


If A is not inbred (and according to information given in this pedigree 
she is not) the inbreeding coefficient of / is simply (1/2)?. Notice that whether 
B, C, D, and E is inbred is irrelevant, since, for example, the probability 
that c and b are identical is independent of the gene contributed by B's other 
parent. The general rule is that the contribution of a path of relationship 
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through a common ancestor is (1/2)"(1 + £4) where n is the number of indi- 
viduals in the path from one parent to the ancestor and back through the 
other parent. 
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Figure 3.4.2. A more complicated pedigree; f = 3/32. 





In more complicated pedigrees there may be multiple paths through an 
ancestor or more than one common ancestor. Consider the pedigree in Figure 
3.4.2. The contributions to the inbreeding coefficient of J from the various 
paths are as follows. The common ancestor in a path is underlined. 


PATH CONTRIBUTION TO f 
ABDEGH (1/2)® = 1/64 
ACDEGH (1/2)§ = 1/64 
ABDFH (1/2)5 = 1/32 
ACDFH (1/2)? = 1/32 


As we are considering only a single locus, the paths are all mutually 
exclusive; if 7 is autozygous for a pair of genes inherited through one path 
it cannot at the same time be autozygous for a pair inherited through another. 
Therefore the total probability for autozygosity is the sum of the probabilities 
for the separate paths, in this case 3/32. 

This pedigree was not complicated by the common ancestor of any path 
being inbred. Individual A is inbred, but this is irrelevant since A is not a 
common ancestor. Only inbreeding of E or F would matter. This complica- 
tion arises in the pedigree in Figure 3.4.3, where there are several inbred 
individuals in the pedigree and two of these, B and D, are common ancestors 
of one or more paths. 
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Figure 3.4.3. A still more complicated pedigree; 
f = far ES .428. 
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We begin by noting the inbreeding coefficients of D and B. The inbreeding 
coefficient of D, fp, is (1/2); fa = (1/2)°(1 + fp) = 9/64. The components of 
f; through the various paths are: 


AEF: (1/2? = .1250 
ABF: (1/27 (1 +f) = .1426 
ABEF: (1/2)* = .0625 
AEBF: (1/2)* : = 0625 
ABCDEF: (1/2)6(1 - fj) = .0176 
AEDCBF: (1/2)°(1 - fp = .0176 

f= he = 4278 


Notice that the inbreeding of B is taken into consideration in path 
ABF where B is a common ancestor, but ignored in the other paths where B 
is not the common ancestor. A path such as ABCDEBF is not included 
because B enters twice; the contribution to this path is included in path A BF 
by the term (1 + fp). 

To summarize: The inbreeding coefficient of an individual J, or the 
coefficient of consanguinity of his parents, J and K, is the sum of a series 
of terms, one for each path leading from a parent to a common ancestor and 
back through the other parent. The general formula is 


fr — fag = EEY + fad], 3.4.1 


where the summation is over all possible paths, n is the number of individuals 
in the path (counting J and K, but not /) and f, is the inbreeding coefficient 
of the common ancestor at the apex of this path. 

A path cannot pass through the same individual twice. No reversal of 
direction is permitted except at the common ancestor; always go against the 
arrows in going from one parent to the ancestor, and with them coming back 


INBREEDING 73 


through the other. It is helpful in avoiding counting the same path twice to 
adopt the convention of starting all paths with the same parent (the male, 
say) and ending with the other. 

In the earlier literature the procedure given for computing f was to count 
the number of steps between individuals in a path rather than the number of 
individuals. The results are of course the same either way. Formula 3.4.1 
was first given in the present form by Wright (1951). We use it because it 
follows more naturally from our derivation than the earlier form, and because 
it is easily adapted to X-linked genes. 

An X-chromosome gene that is in a gamete produced by a male must be 
the same as was in the egg from which this male came. Therefore the proba- 
bility of identity by descent in these two gametes is 1, rather than 1/2 as it 
would be for a female or an autosomal locus. Hence each male in a path 
multiplies the probability of identity through this path by 1 rather than 1/2, 
and the effect is as if the males were not counted at all. Furthermore, a male 
does not receive an X-chromosome from his father, so a path involving two 
successive males makes no contribution to the probability of identity of 
X-chromosomal loci. 

Therefore the rule for obtaining the inbreeding coefficient for a sex- 
linked locus in females is: Proceed as usual except that only females in a path 
are counted and any path with two successive males is omitted entirely. 

As examples, consider again the three pedigrees in Figure 3.4.1, 3.42, 
and 3.4.3. There is no meaning to the inbreeding coefficient of a male since 
he has only one X chromosome; so the pedigree in 3.4.1 is not of interest. 
In Figure 3.42, f; is 1/32 (path ABDEGH) + 1/8(ABDFH) = 5/32. Paths 
ACDFH and ACDEGH have successive males and are omitted. In the same 
manner, in Figure 3.4.3, fp = 0, fg = 1/4, and f, = 5/8. 


3.5 Phenotypic Effects of Consanguineous Matings 


In Section 2.2 the frequency of the recessive gene causing phenylketonuric 
feeble-mindedness was given as approximately 1/100. Therefore with random 
mating the frequency of persons homozygous for the gene is the square of this, 
or 1/10,000. We now inquire how much this is enhanced with consanguineous 
marriage. 

The probability of an affected child as given by 3.2.1 is p*(1 — f) + pf, 
where p is the frequency of the recessive allele and / is the inbreeding coeffi- 
cient of the child. If the parents are cousins their coefficient of consanguinity 
or the inbreeding coefficient of their child is 1/16, as computed from Figure 
3.5.1. With p = 1/100 and f= 1/16 the expected frequency of homozygous 
recessives is 115/160,000, or approximately 7/10,000, a 7-fold increase 
compared with the risk when the parents are unrelated. 
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Figure 3.5.1. Pedigree 
of an individual whose 
parents are cousins. 
J=} +} =1/16. 


Table 3.5.1 illustrates the way in which the relative frequency of a disease 
in the children of cousin marriages increases with the rarity of the gene. 

We can also ask the question the other way around: What proportion of 
the persons affected with recessive traits come from consanguineous marriages ? 

This proportion, K, may be obtained by dividing the number of affected 
from consanguineous marriages by the total number of affected. 1f c is the 


Table 3.5.1. The proportion of homozygous-recessive individuals in a population 
with random mating and when the parents are first cousins for various frequencies 
of the recessive allele. 


FREQUENCY OF AFFECTED WHEN 
GENE 





FREQUENCY "onn d RONS, 
f=0 f—-1/6 
0.1 0.01 0.016 1.6 
0.01 0.0001 0.00072 7.2 
0.005 0.000025 0.000335 13.4 


0.001 0.000001 0.000063 63 
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proportion of consanguineous marriages in the population of size N, the 
number of affected from consanguineous marriages is Nc[p?(1 — f) + pf], 
where f is the coefficient of consanguinity of the parents and p is the recessive 
gene frequency. The total number of affected in the population is 
N[p?(1 — f) + pf], where f is the mean inbreeding coefficient in the popula- 
tion. Therefore 


y . TPC —f)+pf] cdp-(-pf] 
IDEE ADIT M du SUE Edd 3.5.1 


p — f) pf p-pjtj ’ 
or, since pf is usually very small 


dp (1 — pf] 
p+f 


approximately. 


The most common consanguineous marriage is between first cousins. 
For f = 1/16, 3.5.1 becomes 


_ €(1 + 15p) 
l6[p + f(l — p] 
or approximately 


c(1 4 15p) aa 
lép* J) : 


Some representative values of K are given in Table 3.5.2. 

This shows that, even though consanguineous marriages are very rare, 
a substantial fraction of diseases caused by recessive genes comes from 
such marriages if p is .01 or less. Consanguinity of the parents is one of the 
strongest kinds of evidence of recessive inheritance. 

On the other hand, if the recessive gene is common, the increased inci- 
dence with consanguinity is very slight. Cystic fibrosis of the pancreas appears 
to be due to a simple recessive factor, yet there is no appreciable rise in inci- 
dence from consanguineous marriages, because the allele frequency is so 
high. The parental-consanguinity rate is much higher for recessive traits 
where the gene frequency is low. 

Most recessive genes are carried concealed in the heterozygous condition. 
We can get some idea of the total number of such genes carried by normal 
persons through a study of consanguineous marriages. A study by Jan A. Book 
showed that about 16 % of the children of first cousin marriages in Sweden 
had a genetic disease, and if diseases of more doubtful etiology were included 
the number rose to 28 %. The corresponding figures for the control population 
with unrelated parents were 4% and 6%. Thus cousin marriage, by these 


K 3.5.2 
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Table 3.5.2. The proportion, K, of cases of recessive conditions expected from first- 
cousin marriage for various values of gene frequency (p), frequency of cousin 
marriage (c), and population inbreeding coefficient (f). 


c f P K 
0.005 0.0005 0.1 0.008 
0.005 0.0005 0.03 0.015 
0.005 0.0005 0.01 0.034 
0.005 0.0005 0.003 0.093 
0.005 0.0005 0.001 0.212 
0.005 0.0005 0.0003 0.392 
0.01 0.001 0.1 0.015 
0.01 0.001 0.03 0.029 
0.01 0.001 0.01 0.065 
0.01 0.001 0.003 0.163 
0.01 0.001 0.001 0.317 
0.01 0.001 0.0003 0.483 
0.02 0.002 0.1 0.031 
0.02 0.002 0.03 0.057 
0.02 0.002 0.01 0.120 
0.02 0.002 0.003 0.262 
0.02 0.002 0.001 0.423 
0.02 0.002 0.0001 0.546 


data, entails an increased risk of 12% to 22% of having a child with a detect- 
able genetic defect. Since the child of a cousin marriage has an inbreeding 
coefficient of 1/16, we reason that a completely homozygous individual would 
have 16 times as many diseases, or approximately 2.0 to 3.5. Thisis the number 
of recessive factors per gamete (since a homozygous individual may be re- 
garded as a doubled gamete), so the number per zygote is between 4 and 7. 
These figures are based on rather limited data, but they furnish a rough idea 
of the magnitude. The conclusion is that the average human carries hidden 
the equivalent of some half a dozen deleterious recessive genes that, if made 
homozygous, would cause a detectable disease. 

We can also estimate the amount of genetic weakness that is carried 
hidden in a heterozygous individual, but which would be expressed as invia- 
bility if he were made homozygous. Sutter and Tabah (1958 and earlier) 
found from a demographic study in two rural provinces in France that 
children of cousin marriages died before adulthood about 25 % of the time, 
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whereas the death rate from unrelated parents was about 12%. Thus, in this 
environment, cousin marriage increased the risk of death by about 0.13. 
Making the same calculations as above (i.e., multiplying by 16 x 2) we estimate 
that the average individual in this population carries 32 x .13 or about 4 
hidden *' lethal equivalents." We say “lethal equivalents ” because one cannot 
distinguish between 4 full lethals and 8 genes with 50 % probability of causing 
death, or any system where the product of the number of genes and the average 
effect of each is 4. For a more sophisticated treatment of this subject, making 
use of all degrees of relationship rather than just cousins, see Morton, Crow, 
and Muller (1956). 

The data on human inbreeding effects have not been very reproducible. 
The large body of data from Japan show significant heterogeneity effects from 
city to city. There is danger of confounding inbreeding effects with the effects 
of social concomitants of consanguineous marriages. For all these reasons, 
we cannot place too much reliance on the numerical values of the previous 
paragraph. 

It is also to be expected that what is lethal in one environment may be 
only detrimental in a better one. In much of the world there has been a sub- 
stantial rise in the standard of living and a decrease in the death rate. This 
means that the number of lethal equivalents is decreasing. 

In Drosophila, where the measures are precise and reproducible, there 
are about two lethal equivalents per fly. About 2/3 of the viability depression 
from inbreeding is attributable to monogenic lethals; the rest is the cumulative 
effect of a much larger number of genes with individually small effects. 


3.6 The Effect of Inbreeding on Quantitative Characters 


We consider first a theoretical model that is applicable to any measurable 
trait, such as height, weight, yield, survival, or fertility. For initial simplicity, 
a single locus with only two alleles is assumed. The model is summarized as 
follows: 


GENOTYPE Ai Ai A142 A2A2 
FREQUENCY pi(l—f)tpif 2pipAl—f) ÉU- f) +p S 
PHENOTYPE Y—A Y+D Y+A 


In this model Y is the residual phenotype when the A locus is not con- 
sidered. Genotype A, A, adds an amount A to the phenotype, and 4,4, 
subtracts an equal amount. (We could just as well assume that both genotypes 
add to the residual, or that both subtract; this model is chosen arbitrarily 
and for algebraic simplicity. The same result would be obtained in any case.) 
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If there were no dominance, the phenotype 4, 4; would be Y. Under this 
circumstance, the amount by which the phenotype is changed by substituting 
an A, for an A, is always A, which we can call the additive effect of the A 
locus. D is a measure of dominance. When D = 0, there is no dominance; 
when D = A, A; is completely dominant; when D = — A, A, is completely 
recessive, or 4, dominant; when D > A, there is overdominance, the hetero- 
zygote having a higher phenotypic value than either homozygote. 

We now obtain an expression for the mean phenotype Y. This will 
be given by summing the products of each phenotype and its frequency. 
(Since the frequencies add up to one, the sum is the same as the mean.) 
Therefore, 


Y =(Y—A)[p20) —f) - pf] (Y + D)2p,p2(1 — f) 
+(Y AX — 7) + p2f]. 


which leads after some algebraic rearrangement to 


Y = Y + A(p; — p) + 2pp3 D — 2p pı Df 


3.6.1 
= Hf, 

where G = Y + A(p; — pı) + 2p,p; D and H = 2p, pa D. G is the average 
phenotype with random mating and G — H is the average with complete 
homozygosity. Notice that H is positive if D is positive. 

This equation brings out two important facts about the phenotypic 
consequences of inbreeding. The first is that in the absence of dominance 
(D = 0), there is no mean change with inbreeding. The second fact is that 
the equation is a linear function of f. This means that whatever the level of 
dominance, as measured by D, the change with inbreeding is proportional 
to the inbreeding coefficient. As long as D is positive (i.e., the heterozygote 
has a larger phenotype than the mean of the two homozygotes), inbreeding 
will produce a decline. 

There are two possible causes of the inbreeding decline that is so univer- 
sally observed: (1) Favorable genes tend to be dominant or partially dominant 
(0 « D « A), and (2) the heterozygote has a higher phenotype than either 
homozygote, (0 < A < D). Notice that the observation of a linear decline 
in a quantitative trait cannot discriminate between these possibilities, for 
it would be expected with either type of gene action, or any mixture of the 
two. To discriminate between them will require other kinds of evidence. 

The extension to more than two alleles is straightforward and will not 
be given here. We shall consider the extension to more than one locus. 
Consider a model with two loci. each with two alleles. We shall let 4 stand 
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for the additive effect of the A locus and B for that of the B locus. If there is no 


interaction between loci A and B, the model is as follows: 


GENOTYPE AA: A, A2 A2 A2 
FREQUENCY pi(l—f)+p,f 2pipAl—s) pil—f/)+pf 


BiB, ril—D)-nf Y—A-— B Y+ D,—B Y+A-—B 


BB: 2r,ri(1 — f) Y-—A+D, Y+ Dı + Ds Y+ A+ Ds 


BiB; ri(1 —f)-o rf Y— A+B Y+D,+8 Y+A+8 


If the A and B loci are independent and in gametic phase equilibrium, the 
frequency of any of the nine classes in the table is given by the product of 
the frequencies at the borders. We get the mean phenotype as before, by 
multiplying each phenotype value within the table by its frequency and sum- 
ming the nine products. After simplification this leads to 


Y = Y + A(p2 — ps) + 2p1p2 Da + B(r2 — ri) 


3.6.2 
+ 2rir; Dg — 2(pip2 Da + rir? DBY 


As before, there is a linear relation to the inbreeding coefficient (unless 
D, = Dy = 0), as might be expected from knowledge that this is true for 
either locus by itself. We now complicate the model by assuming an inter- 
action between the two loci. In population genetics the word epistasis is 
given a meaning broader than its classical meaning so as to include all levels 
of nonadditive effects between loci. Any circumstance where a substitution 
at the A locus has a different effect depending on the genotype at the B locus 
is an example of epistasis. A simple way to construct such a model is to add 
interaction terms to each of the values already given so that the phenotypes are 
now: 


AA: A1A2 A2 Az 
BiB, Y—A-—B-«I YTtD,—-B-L Y+A—B-I 
BB; Y—A+Ds—K Y+Dı+Ds+J Y+A+Ds+K 
Bi Bi Y—A-cTB-—I Y+Dı+B+L Y+4+B+I 


Notice that there are nine parameters, Y, A, B, D,, Dg, I, J, K, and L, 
which correspond to the nine phenotypes so there is a complete specification 
of the phenotypes when the parameters are given, and vice versa. 
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The formula for Y may be written as 
Y =G- Hf+ Mf’, 3.6.3 
where 


G = Y + A(p2 — pı) + B(r3 — rı) + 2D4 Pı P2 + 2Dgnryr; 
Fry — r3Xpy — P2) + 2Kryr2(p2 — Pi) + 2Lpypx(ra — ri) 
+ 4Jpipa rsrz, 

M = Appanrir;J, 


and 


H = 2[p\p2 Da + rir Dg + pipy(ra — r)L 
t ryr2(p2 — p)K + Ap parira J]. 


In this model, A and B represent the additive effects of the two loci, and 
D, and Dg the dominance effects, as before. / is the effect of pure epistasis 
without dominance; in other words, the interaction of the additive effect of 
A with the additive effect of B. K is a measure of interaction and dominance; 
it is the effect of the A locus on the dominance of the B locus. Likewise, L is 
the effect of the B locus on the dominance of the A locus. Another way of 
saying this is that this is the interaction of the additive effect of the B locus 
with the dominance effect of the A. Finally, J is the epistatic effect of the two 
dominances; it is the interaction of dominance at the A locus with dominance 
at the B locus. 

From equation 3.6.3 we see that if all the terms involving dominance 
effects—D ,, Dg, J, K, and L—are 0, there is no inbreeding effect since 
the coefficients of f and f? are 0. Thus epistasis alone, without dominance, 
does not produce an inbreeding decline. 

If J = 0 the inbreeding change is linear in f. In order for the coefficient of 
f? to be other than 0, there must be interaction between the two dominance 
effects. 

If A, B, D,, Dg, K, and L are all positive, then the genes with subscript 2 
are associated with increased performance (or yield, or fitness, or whatever is 
being measured). This also generally means that the alleles with subscript 2 
will be more frequent than those with subscript |. Then if J is positive there 
will be diminishing epistasis during inbreeding. By this, we mean that the 
curve is concave upward and that homozygosity for two loci reduces perform- 
ance by less than the sum of the individual effects. Contrariwise if J is 
negative (assuming H > 0), the epistasis is reinforcing. That is, the deleterious 
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effect of two loci is more than cumulative. This is sometimes called synergistic. 
These general types of epistasis are illustrated in Figure 3.6.1. 







Diminishing Epistasis 


Performance 


No Epistasis 


Reinforcing Epistasis 





f 


Figure 3.6.1. Decline in performance (or 
any trait of interest) with inbreeding under 
no epistasis, diminishing epistasis, and 
reinforcing epistasis. 


3.7 Some Examples of Inbreeding Effects 


Table 3.7.1 gives some numerical examples and illustrates the way in which 
specific levels of dominance and epistasis may be constructed by choosing 
appropriate values for the parameters. In these examples the gene frequencies 
are all 1/2. The values are contrived so that fully homozygous individuals 
(f = 1) have an average phenotype of 10. 

Models | and 2 show that, whether or not there is epistasis, there is no 
inbreeding effect without dominance; the phenotype is 10, independent of f. 

Models 3 and 4 show that the rate of decline with inbreeding by itself 
cannot distinguish among different levels of dominance. Model 5 is an extreme 
form of reinforcing epistasis; model 6 is an extreme form of diminishing 
epistasis. Models 6 and 9 show that quite different systems of gene action 
can give the same quadratic equation. 

Models 7 and 8 show that epistasis, though it is a necessary condition 
for a nonlinear inbreeding effect, is not sufficient. In both there is epistasis, 
yet the equations are the same; moreover they are the same as models 3 and 
4, Only when J is not 0 is there a nonlinear effect, as 5, 6, and 9 show. 

The examples in Table 3.7.1 all assume equal frequency for the two 
alleles at each locus. Usually this is not the case. Let us assume that the higher 
phenotypic value is desirable, which means that for most of the models 
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Table 3.7.1. Some numerical illustrations of the phenotypic values under various 
models of dominance and epistasis. Also given is the average phenotype, Y, when 
p—pi—rn-ra-—.S. 


MODEL 


DESCRIPTION 


No dominance, no epistasis 


Y=10,A=B=l 


D,= Dp =1=J=K=L=0 


Epistasis, but no dominance 
Y=10,A=B=2,/=1 
Dı =Ds=J=K=L=0 


Complete dominance, no epistasis 
Y =10, A =B = Dı = De = |1 


I=J=K=L=0 


Overdominance, no epistasis 
Y = 10, A = B =0, Dı = Ds = 1 


I=J=K=L=0 


Complete dominance, 


complementary recessive genes 
Y=10,A=B=D,= Dsg=1 


I=J=K=L=~—_1 


Complete dominance, 


complementary dominant genes 
Y = 10, A = B = Dı = De = 1 


l=J=K=L=1 


Dominance and epistasis 
Y = 10, A = B = Dı = Dg= 1 


1=K=L=1,J=0 


Dominance andepistasis 
Y=10,A=B=D,=D,=I1=1 


J=K=L=0 


Overdominance and epistasis 
Y= 10,4 = B =0, Dı = Ds = 1, 


I=K=L=0,J=1 


PHENOTYPIC 
VALUE 
8 9 10 
9 10 11 
10 11 12 
7 8 9 
8 10 12 
9 12 15 
8 10 10 
10 12 12 
10 12 12 
10 11 10 
11 12 11 
10 11 10 
7 11 H 
11 11 1l 
11 11 11 
9 9 9 
9 13 13 
9 13 13 
9 9 9 
9 12 13 
9 13 13 
9 10 9 
10 12 12 
9 12 13 
10 11 10 
11 13 11 


10 11 10 


AVERAGE 
PHENOTYPE 
Y —10 
Y —10 
f=11 -f 
f=11 -f 


ř=11 -f 


Y-11—f 
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A, and B, will tend to be more frequent than their alleles. Consider as an 
example that p, =r; =.1. Then the equations for models 5 and 6 become: 


Model 5, complementary recessive genes: Y = G — .007f — .032/?, 
Model 6, complementary dominant genes: Y = G —.356f + .032/?. 


Model 5 shows that with a trait depending on simultaneous homo- 
zygosity for two rare recessives, there will be very little inbreeding effect when 
f is small—because the coefficient of f is small, and f? is very small. Only 
when the inbreeding is sufficient for f? to be appreciable will its larger 
coefficient become important. Thus, to the extent that there are detrimental 
traits depending on multiple homozygosity, inbreeding effects will tend to 
be nonlinear, with very little effect of slight inbreeding, but with an accelerating 
effect at very high levels of inbreeding. 

With model 6, on the other hand, the quadratic term never becomes im- 
portant and the linear term dominates for all values of f. Thus for rare genes 
with duplicate effects the inbreeding effect is linear for all practical purposes. 

When the trait considered is survival it is often more natural to measure 
epistasis as deviations from independence rather than from additivity. 
Survival probabilities are multiplicative if the genes act independently. It is 
often advantageous to transform to logs, or to measure fitness in Malthusian 
parameters. 

We summarize this section by stating three cenclusions, all of which are 
apparent from 3.6.3 and are illustrated by the numerical examples in Table 
3.7.1. 

1. If there is no dominance, there is no mean change in phenotype with 
inbreeding regardless of the amount of epistasis (models 1 and 2.). If D,, 
Ds, J, K, and L are 0, Y is not a function of f. 

2. If there is dominance, but no epistasis, the effect of inbreeding on the 
phenotype is linear in f. Usually inbreeding leads to a change in quantitative 
measures and if there is no epistasis this change is proportional to f (models 
3and4) If I = J = K = L —0,the term in f? drops out. 

3. If there is both dominance and epistasis the inbreeding effect may be 
quadratic in f (or higher order if more than two loci are involved). With 
reinforcing type epistasis the inbreeding effect is greater than if the loci were 
additive; with diminishing epistasis the effect is less (models 5 and 6). How- 
ever, the change in average phenotype with inbreeding is not necessarily 
quadratic (models 7 and 8); as long as J = 0, the inbreeding effect is linear. 

Reliable data on the results of inbreeding uncomplicated by the effects 
of selection are rare. There is also a difficulty in choosing an appropriate 
scale of measurement if the linearity of the inbreeding effect is to be tested. 
Some of the best data come from maize and Table 3.7.2 shows an example. 
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The yield was measured for inbred lines, crosses, and randomly mated prog- 
eny from fields of crosses. 


Table 3.7.2. The average yield in bushels per acre of randomly pollinated maize 
derived from hybrids between inbred lines. Expected yields are based on assumption 
of no epistasis. Data from Neal (1935). 


RANDOMLY POLLINATED 
HYBRID INBRED 


AVERAGE AVERAGE 





f=0 f=1 f eu: OBSERVED 
(G) (G — H) (G — Hf) YIELD 
10 two-way hybrids 62.8 23.7 .500 43.3 44.2 
4 three-way hybrids 64.2 23.8 .375 49.] 49.3 
10 four-way hybrids 64.1 25.0 .250 54.3 54.0 


Three kinds of crosses were tested, two-way, three-way, and four way (or 
double-cross) hybrids. Let A, B, C, and D stand for four lines that have been 
self-fertilized long enough to be regarded as completely homozygous. Two- 
way crosses are first generation hybrids between two lines, e.g., 4 x B; 
three-way crosses are between a hybrid and a different inbred, e.g., (A x B) 
x C; four-way crosses are between two different hybrids, e.g., (A x B) 
x (C x D). 

If a field of two-way hybrids is allowed to pollinate at random the prob- 
ability that two alleles have come from the same parental inbred line is 1/2. 
Assuming the parent line to be autozygous, two alleles from the same line 
are identical; hence the progeny from random pollination have an inbreeding 
coefficient of 1/2. For a four-way cross the probability of two alleles from 
the same line is 1/4. For the three-way cross, (A x B) x C, the probability 
of two alleles both coming from C is 1/4, from A is 1/16, and from B is 1/16; 
fis the sum, or 3/8. 

The data in Table 3.7.2. show the close agreement with the expected 
values. Since the inbreeding effect is so nearly linear with the inbreeding 
coefficient, this implies that epistasis is not very important in corn yield. 
Either the genes at different loci act additively on yield or opposite interactive 
effects cancel each other. 

Figure 3.7.1 shows data from Drosophila gathered in a different way. 
In these experiments the cultures were maintained with as little natural selec- 
tion as possible and the chromosome number 2 was kept heterozygous. 
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Then after several generations the chromosome, with its accumulated reces- 
sive mutations, was made homozygous. That there is an appreciable syner- 
gistic effect is clear from the graph. The mutants presumably accumulate 
linearly with time, but the homozygous viability decreases somewhat more 
than linearly. The graph shows only the influence of mutations with small 
effects; chromosomes with lethal mutations are not included. 


Homozygous Viability. V 
o o 
b [5] 
i 


0.3- 


0.2 - 


0.1 - 





o 10 20 30 40 50 60 
Generation, t 


Figure 3.7.1. The viability of Drosophila homozygous for 
the second chromosome as a function of the number of 
generations during which mutants were permitted to 
accumulate. All lethal chromosomes are omitted. There is 
clearly some reinforcing epistasis for the deleterious effects 
of the accumulated mutations. Data from T. Mukai. 


3.8 Regular Systems of Inbreeding 


In Section 3.4 we derived methods for computing the coefficients of inbreeding 
and consanguinity from pedigrees. The same methods can be used to derive 
recurrence relations for f in successive generations with regular systems of 
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inbreeding. A detailed treatement of this subject was first given by Wright 
(1921) and recent summaries are available (Wright, 1951; Li, 1955a). We 
shall illustrate only some of the simpler, but most important cases. 


1. Self-fertilization From Figure 3.8.1 and the principles already 


Generation 


-1 


oN, | 
Sly 
C) t 


Figure 3.8.1. A 
diagram of self- 
fertilization. 
Zygotes are shown 
in large circles; 
gametes by dots. 


discussed in Section 3.4, 


f, 5a f) 3.8.1 


where /, is the inbreeding coefficient in generation t and /,_, is the coefficient 
one generation earlier. To obtain the change in heterozygosity we utilize the 
relation f, = (Hy — H,)/H> from equation 3.2.4, where H, and Hj, are the 
proportions of heterozygosity in generation t and initially (t = 0) . This leads 
to 


H, — 1H,-, 
= (4)°H,_2 = Q)' He. 
This confirms the result stated in Section 3.1; with self-fertilization the 
amount of heterozygosity is reduced by one-half each generation. After 10 
generations, only 1/1024 of the loci that previously were heterozygous remain 


heterozygous. 
For a matrix treatment of this, see A.7. 


3.8.2 


2. Sib Mating  Recalling previous definitions, the inbreeding coefficient 
J, is the probability that two homologous genes in an individual Z are identical 
by descent; the coefficient of consanguinity fj, is the probability that two 
homologous genes, one chosen at random from individual / and the other 
from individual J are identical. 
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We let f, be the inbreeding coefficient of an individual in generation t 
and g, the coefficient of consanguinity of two individuals (necessarily sibs 
after the first generation). The inbreeding coefficient of an individual is the 
same as the coefficient of consanguinity of his parents; that is, f, — g,. ,. 


5 eneration 
cl 


bes 


Figure 3.8.2. Continued 
sib mating. 


t 


T 
[ea 
i 
L| 


Two genes in different individuals in generation t came from the same indi- 
vidual in generation t — ] with probability 1/2, in which case their probability 
of identity is (1 + f, ,)/2. They came from different individuals also with 
probability 1/2, in which case the probability of identity is g,_,. Putting these 
together, we have 


Si = gi 
g= kl tf-1) + ig: 


We can, if we wish, eliminate the g’s from these two equations and write 
a relation involving only the inbreeding coefficients. 


fi = MI 2f 0 fi 3.8.4 


Letting f, = (Ho — H)/H,, we can obtain a recurrence relationship for the 
heterozygosity 


3.8.3 


H, — 1H,., +4H,-2. 3.8.5 


If Hy = H, = 1/2, we obtain the sequence 3/8, 5/16, 8/32, 13/64, etc., 
as given in Table 3.1.2. 

In order to get a more general expression for the heterozygote frequency, 
H, , in terms of the initial frequency, Ho, we let f, = 1 — ^, and g, =1 —k,. 
Then equations 3.8.3 become 


Jt, = Kou 
k, = 3h. + 1k, 


3.8.6 
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or, in matrix form, 
tc) = 6 207) 
k, ł i k,, 
alaa] 
4 4 Wy. 
The rules for matrix multiplication are given in A.7. However, there is a 
standard procedure for obtaining h, directly without having to proceed gener- 


ation by generation. For an explanation using this same example see A.8. 
The characteristic equation for this matrix is 


3.8.6a 


0-4À l 


=0, 
4 1-4 








which leads to 
A? —44-1=0, 3.8.7 


of which the largest (and only positive) root is 


A= : 2 = 0.809. 3.8.8 





Using both roots we can write an exact expression for h, for any genera- 
tion, t. This is shown with this same example in the appendix starting with 
equation A.8.13. 

We are especially interested in the limiting rate of change in heterozy- 
gosity, which is given by the largest root, 3.8.8. Thus, after t is sufficiently 
large, ^, is approximately Ah,_,, which is equivalent to 


H, =~ ÀH, ., = 0.809 H, 1. 3.8.9 


Comparison with the actual values in Table 3.1.2 shows that this equation 
becomes exceedingly accurate after about five to six generations. 

We can also obtain A directly and simply from 3.8.5. Dividing both 
sides of the equation by H,. ,, and letting H,/H,. , = H,-,/H,~2 = A (which 
of course implies a constant rate of decrease in heterozygosity), we obtain 
equation 3.8.7. directly. This procedure has been regularly used by Wright 
(1933a, 1951). 

If we desire a system for comparison of two mating systems in regard 
to their ultimate progress toward homozygosity, we can ask how many gener- 
ations are required by the two systems to achieve the same reduction in 
heterozygosity. We can do this by equating (4,)"! to (4,5)" where t is the number 
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of generations to achieve a given level of change in heterozygosity and the 
subscripts refer to the two mating systems. Writing 


AM = AP 3.8.10 
and taking logarithms of both sides, we obtain 


ti, Tog 4; 3.8.11 
t, loga, ia 





For example, with self-fertilization A = .S while with sib mating A = .809. 
This leads to 


t, log.500  —.3010 


= = = 3.27. 
t, log.809  —.0921 








So we can say that asymptotically one generation of self-fertilization is equal 
to a little more than three generations of sib mating. 

This method of comparing mating systems was first used by Fisher (1949). 
There are of course many properties of a mating system other than the rate 
of change in heterozygosity. These can be studied by writing out all possible 
matings and noting how their frequencies change in successive generations. 
This quickly becomes unmanageable unless matrix methods are used. The 
matrix procedure first introduced by Haldane (1937a) and further developed 
by Fisher (1949) can be used to obtain a general solution. 


3. More Complicated Systems Figure 3.8.3 shows a pedigree in which, 
starting with the third generation, all matings are between double first cousins. 


A 8 C D 


Figure 3.8.3. Repeated mating of 
double first cousins. This is the 
maximum avoidance of inbreeding in a 
population of size 4. A = .920. 
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To write equations for this system we let f, be the coefficient of inbreeding 
of an individual in generation t, as before. Notice that there are two kinds of 
relationship in generation t. We let g, be the coefficient of consanguinity of 
nonsibs such as 4 and B, and j, be that of sibs such as A and C. Thus 


9. — fan = fac = fcn = fan» 


Ji = fac = fap . 

From these definitions it follows that: 

f = Gr-1> 

Jı =4G91-1 Ti. 3.8.12 


Jj = 48 + if-)-1ig-: 
Letting f= 1 — h, g =1 — k, and j = 1 — m in 3.8.12, we obtain 
h,= ks 
k= lk, ., +4m,_, 3.8.13 
m, = ih, ., + dk, ,. 


Table 3.8.1. Decrease of heterozygosity with four mating systems, starting from a 
randomly mating population. The numbers are the ratio of the heterozygosity in 
generation / to the original heterozygosity, H,/Ho = h. 


TT a DOUBLE FIRST- CIRCULAR 
GENERATION FERTILIZATION MATING cor hie 
; AE pm MATING MATING 
N=4 N=4 
0 1.000 1.000 1.000 1.000 
l .500 1.000 1.000 1.000 
2 .250 .750 1.000 .875 
3 .125 .625 .875 .813 
4 .063 .500 .813 .750 
5 .031 .406 .750 .695 
6 .016 .328 .688 .644 
10 .001 .141 .492 .471 
15 .048 .324 .327 
20 .017 .213 .224 
30 .002 .092 .105 
50 .017 .023 





A .500 .809 .920 927 
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These are now homogeneous and can be written in matrix form 


h, O 1 OV/h., 
k |21|0 4 1][k.-, 


m, à 4 0/yn., 3.8.13a 
The characteristic equation is 
—4 l 0 
0 4-A 4 =0, 3.8.14 
i 1j A 
which, upon expansion, becomes 
4? — 44? — 44 - } = 0, 3.8.14a 


and the largest root is A = .9196. 

Some numerical values for heterozygosity with this mating system are 
given in Table 3.8.1. 

Notice that for a population of size 4 this is the system of mating in 
which mated pairs are least related. A corresponding system in a population 
of size 8 would be quadruple second-cousin mating. Wright (1921) designated 
such systems as having maximum avoidance of inbreeding. 

Such systems do, in fact, minimize the rate of approach to homozygosity 
during the initial generations, but somewhat surprisingly there are systems 
of mating that ultimately have a slower rate of decrease in heterozygosity. 
An example, for a population of 4, is half-sib mating, or circular mating, as 
illustrated in Figure 3.8.4. 

Letting g, be the coefficient of consanguinity of individuals one position 
apart and j, be that for individuals two positions apart, 


9: = fan = fec = fep = fap 


A 8 C D 


Figure 3.8.4. Half-sib mating in a 
population of 4, or circular mating. 
à = .927. 


92 AN INTRODUCTION TO POPULATION GENETICS THEORY 


and 
Je = fac = fap; 
we have 
Á4-749-» 
9: =444+ Mic 39a + dhe, 3.8.15 


j= 19.-1 + lj 


Substituting f= 1 — h, k = 1 — g, and m = | — j as before leads to 


h, 0 1 O\ /A,-, 
i = f 4 ] (e | 3.8.16 
m, 0 3 i um. 


with the characteristic equation 
V-17+7;=0 3.8.17 
and the largest root is 


A = .9273. 


Notice that the eventual rate of decrease in heterozygosity is less in this 
system than with double first-cousin mating. Referring to Table 3.8.1, we 
see that the heterozygosity curves for the two systems cross at about the 
fifteenth generation. The general principle is that more intense inbreeding 
produces a lower ultimate rate of decrease in heterozygosity, provided that 
there is no permanent splitting of the population into isolated lines. Con- 
versely, a system that avoids mating of relatives for as long as possible does 
so at the expense of a more rapid final approach to homozygosis. The breeder 
therefore may choose a different system of mating if he is more interested 
in maximum heterozygosity during the initial generations than in the long- 
time future population. An extension of the procedures of this section to larger 
populations than N = 4 has been given by Kimura and Crow (1963). Robertson 
(1964) and Wright (1965a) have shown that many of these results can be 
brought together very generally under a single point of view. For other types 
of mating systems see Wright (1921, 1951). Many of Wright’s earlier results 
are summarized by Li (1955). 


4. Partial Self-fertilization All the examples discussed thus far lead 
eventually to complete homozygosity. This is not always the case, and we 
shall now consider one such example. This is the simple, yet important, case 
where a certain fraction each generation are self-fertilized and the remainder 
are mated at random, a situation found in several plant species. 
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Let S be the fraction of the population that is produced by self-fertiliza- 
tion; then 1 — S is the fraction that is produced by random mating. From 
3.8.1 we can write the expected recurrence relation for f as 


f.- St + f.-002] + (1 — SX) = > ERE you 


This assumes that the plants to be self-fertilized each generation are a random 
sample of the population; for example, there is no tendency for the progeny 
of self-fertilized plants to be self-fertilized. 

Substituting f, = (Ho — H,)/ Hg from 3.2.4 into 3.8.18 we get 


S 
H, = Ho(l - S) + 5 Hi. 3.8.18a 


Subtracting 2(1 — S)Ho/(2 — S) from both sides and simplifying, 











= o> H, -$ |r,- E i Ho 
= (5) In... - -9 Ho 3.8.19 





= (5) EZ = ea tol. 





0 5 10 15 20 
t 


Figure 3.8.5. Change in heterozygosity with four mating systems. 

A. Self-fertilization; B. Sib mating; C. Double first-cousin mating; 

D. Circular half-sib mating. The ordinate is the heterozygosity relative to 
the starting population; the abscissa is the time in generations. 
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Since (S/2)' approaches 0 as t becomes large, the heterozygosity approaches 
a limit where the heterozygosity is a fraction 2(1 — S)/(2 — S) of its original 
value. The rate of approach is such that the departure from the equilibrium 
value is decreased by a fraction 1 — S/2 each generation. Notice that when 
S = | we get the usual formula for self-fertilization. 

This situation is striking in that unless S is large there is almost no 
cumulative effect ; most of the effect occurs in the first generation. For example, 
with 10% self-fertilization, the initial heterozygosity is reduced by 5% in 
the first generation, but even when equilibrium is reached the reduction is 
only 5.3%! 


5. Repeated Backcrossing to the Same Strain Frequently a plant 
breeder may wish to introduce one or more dominant genes from an extrane- 
ous source into a standard variety. For example he may have a highly desirable 
variety, except for its being susceptible to some disease. The resistant gene 
may exist in another strain which is less desirable in other respects. He can 
introduce this gene by crossing the two strains and then repeatedly crossing 
resistant plants to the susceptible strain. In this way the resistant gene is 
inserted into a genetic background that becomes more like the susceptible 
strain with each backcross. As another example, a mouse breeder may wish 
to introgress a new histocompatibility gene into a standard inbred strain. 

It is clear that in recurrent backcrossing the number of loci that contain 
genes from both strains is reduced by half each generation. Thus, after ¢ 
generations, a fraction equal to 1 — .5‘ are from the recurrent parental 
strain. After seven generations less than 1 % of the loci contain a gene from 
the other parental strain. If the recurrent parental strain is homozygous, 
the heterozygosity willreduce by half each generation, as with self-fertilization. 

However, genes that are linked to the resistance or histocompatibility 
gene will tend to remain heterozygous. The question of how large a linked 
region will remain after a certain number of generations of backcrossing has 
been investigated by Haldane (1936) and Fisher (1949). 

Consider a chromosome segment on one side of the selected factor and 
let the length of the segment be 100x map units in length (see Figure 3.8.6). 


Selected 
Locus 


| 


—— M ————————— 
X o EO 


Figure 3.8.6. A chromosome segment. One locus is selected 
during recurrent backcrossing and the problem is to deter- 
mine the length of chromosome to the right of this locus that 
will be intact after ¢ generations. 
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If there is no interference, the probability of no crossover in this interval in 
one generation is e * (see Appendix A.5.6). The probability of no crossover 
in t generations is e '*. The chance of a crossover in the small interval x to 
X +dx is dx, if we take this interval small enough that multiple crossovers 
can be ignored. The probability of a crossover in the interval dx sometime 
during t generations is tdx. Thus the probability after t generations of having 
had a crossover in the interval dx but not in the interval x is e^ "tdx. Then the 
mean value of the intact interval x is 


S 1 
X= Í e "tx dx = P 3.8.20 
0 


or 100/t map units. 

For example, after 20 generations the average segment remaining intact 
would be 5 units on each side of the selected locus, or 10 units altogether. 
The derivation has assumed no interference, but for short regions the interfer- 
ence pattern makes very little difference. 

There is a closely related question that can be answered very simply. 
We ask: What is the mean number of backcross generations that a gene will 
remain linked to the selected locus when the recombination probability is 
r? The number may be derived as follows. The probability that the genes will 
remain linked for t generations and then recombine in the next generation 
is (1 — r)'r. The average number of generations until they become separated 
is 


i-r-42(1—- r)r - X1- rr c 4(1- rfr: 
= r(14 2y + 3y?+4y?4---), 
where y = 1 — r. 
Notice that the quantity in parentheses is the derivative of the series 
l+y +y? typi ytt 1/(1 — y). Therefore 


i d (= ) -——— : 3.8.21 
= y — => — fe 
dy \1— y (1—y»? r 


The value of x (the map distance in 3.8.20) and r (the recombination 
value in 3.8.21) will not in general be the same, but will become more and 
more similar as x becomes small enough that multiple crossovers can be 
neglected. 


3.9 Inbreeding with Two Loci 


The idea of the inbreeding coefficient can be extended to cover multiple loci. 
The principal item of interest is that inbreeding may cause the association of 
two or more recessive traits. This can happen in either of two ways: (1) with 
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unlinked loci but nonuniform inbreeding, and (2) with uniform inbreeding and 
linkage—and, of course, because of both linkage and nonuniform inbreeding. 

Our treatment follows very closely the method of Haldane (1949). We 
consider first unlinked loci in linkage equilibrium. 

Let p; be the frequency of allele A; and r, that of the independent allele 
B,. The frequency of 4,4; homozygotes (from 3.22) is p'(1 —/) + p; f= 
p? + p(1 —p)f with a similar formula for B, B,, rZ + r(1 — rj)f. 

Since the loci are in linkage equilibrium the frequency of A; A; B, B, 
is the product, or 


(p? paf Kr? + rsf) = p?r? + (pgr? + p?rsy + pqrsf?, 


where q = 1 — p and s = 1 — r, and the subscripts have been dropped for 
simplicity since we are discussing only one allele at each locus. 

In a population with different values of f from individual to individual 
the frequency of the double homozygote, P(AABB), is 


pr) + (pqr? + prs + parsf?. 


But, by the definition of the variance 


P= +y, 
Hence 
P(AABB) = (p? + paf Xr? + rsf) + pars V}. 3.9.1 


Suppose that in the human population two recessive genes each have a 
frequency of .01, and that 1 7; of the marriages are between cousins while 
the rest are random. Then f= 1/1600, V, = 1/25600 — (1/1600)? = 3.87 
x 1075, (p? + pafY(r? + rsf) = (00011)? = 11.28 x 107°, and pgrsV, = 
3.79 x 107°. Hence the frequency of double homozygotes in the population 
is 15.07 x 107°, just about 4/3 what it would be if all families had the same 
inbreeding coefficient (V, — 0). 

To consider the second case we extend the inbreeding and consanguinity 
coefficient, f, to two loci. Let F be the probability that both of two loci carry 
identical alleles. F will be a function, not only of the pedigree, but also of the 
amount of recombination between the two loci. 

A pair of uniting gametes can be: (1) identical for both loci with proba- 
bility F; (2) identical for only the A locus, probability f — F; (3) identical for 
only the B locus, also f — F; or (4) identical for neither, with probability 
| — 2f +F. 

If the population has reached equilibrium with respect to the linkage 
phases, the frequency of AB gametes will be the product of the frequencies 
of the A and B genes. 
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Thus 
P(AABB) = prF + pr*(f — F) + p’r(f— F) + p?r?(1 — 2f + F) 
= pr(Fqs + far + fps + pr) 
= pr[(asf^ + far + fps + pr) + Fas — f?qs] 
= (p^ + fpaXr^ + frs) + ópars, 
where ¢ = F — f?. The frequency of any other genotype is obtainable the 
same way. For a summary table see Haldane (1949). 
The quantity @ measures the degree of association in identity beyond 
that which would occur if the loci were independent. A comparison of 3.9.1 


and 3.9.2 shows that $ and V, enter the formula the same way. The effects 
are approximately additive, so that, 


3.9.2 


P(AABB) = (p? + fpqXr? + frs) + (6 + V,)pqrs 3.9.3 


with analogous expressions for other genotypes. 

There remains the problem of computing F (or $) from a pedigree. There 
is no simple algorithm for F comparable to that for f. However, Denniston 
(1967) has discovered a method for computing F for any degree of relation- 
ship. The simple relationships are not hard to do by strong-arm methods. 

Consider first two gametes produced from the same individual. They 
will be identical if both sets of alleles are derived from the same chromosome, 
there having been no recombination between A and B, orif the A’scomefrom 
one chromosome and the B’s from the other, which demands that each gamete 
be the result of a recombination. Letting c be the frequency of recombinant 
gametes, and d the frequency of nonrecombinant (c + d = 1), the probability 
of double autozygosity is 


F= Y(c? + d?), 
J= 1, 
$ = ic — ay. 


With parent-offspring mating, in order to have double autozygosity one 
of the identical gametes from the common ancestor must pass intact through 
another generation, which has probability d/2. Therefore 


F = lc? + a?)d, 


f=4, 
= kld — c)(3d? + c°). 
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For half-sibs, each of two gametes must pass intact through a generation, 
which has probability d?/4. Thus 


F= (c? + a?)a?, 

f=, 

$ = (8d? + 8c!d? — 1). 

With full-sib mating a complication arises since there must be considera- 
tion of the possibility that the A’s come from one grandparent and the B's 


from the other. For the details we refer to Haldane's paper; his results for 
several kinds of relationship are given in Table 3.9.1. 


Table 3.9.1. Values of the inbreeding coefficient for two loci, where c is the pro- 
portion of recombination and d = 1 — c. The parents are not inbred and are in 
linkage equilibrium. From Haldane (1949). 





RELATIONSHIP 
OF PARENTS å f peu 

Identical He? + d? i e-d}? 
Parent-offspring | i(e? + d?)d i js(d—oe)3d + c?) 
Full sibs sQa^ + 2c!d? + c?) X qs(4d* + dc?d? + 2c? — 1) 
Half-sibs &(c? + d?)d? i d(8cua?-484*— 1) 
Uncle-niece qig(2d* + 2d?c? + c?)d i (8d? + 8c?d? + 4c?d — 1) 
First cousins gig(2d* + 2c?d? + ¢*)d? 4 3is(1645-- 16e?d^ + 8c?d? — 1) 
Double 


half-cousins = rig(845 + 8c?d^ + c?) 7e 7is5(164d5 + 16c?d^ + 2c? — 1) 


It is interesting that several relationships with the same f may have differ- 
ent F’s, Compare, for example, ordinary first cousins with double half- 
cousins. 

As a numerical example, assume p = r = .0!. The frequency of AA or 
BB in the progeny of cousins is .00072 compared with .00010 with random 
mating. In the absence of linkage the frequency of AABB is (7.2 x 10~*)? = 
5.2 x 1077; with 10% recombination the frequency is 5.2 x 107’ + ópgrs = 
34.6 x 1077, 

However, such a sharp rise is expected only with rare genes and close 
linkage. Some association of rare recessive traits should be expected as a 
consequence of inbreeding and linkage, but probably not of sufficient magni- 
tude to be detected in ordinary circumstances. 
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The effect of inbreeding on quantitative traits involving two loci when 
epistasis and linkage disequilibrium are present may be obtained by using 
expressions like 3.9.3. for all the genotypes. Nei (1965) did this for the mean 
fitness, as an extension of 3.6.3, which was derived by assuming linkage 
equilibrium. 


3.10 Effect of Inbreeding on the Variance 


In Section 3.6 we considered the effect of inbreeding on quantitative traits, 
especially as the mean is affected. In addition to its effect on the population 
mean, inbreeding also has an effect on the variance. 

Consider again the single-locus model of Section 3.6. It is convenient 
to let Y = 0, as this does not change the conclusion and saves some trouble- 
some algebra. To assess the total effect we consider first a single locus. 


GENOTYPE A\A, Aid? A.A, 


FREQUENCY pPl—-N+pf ?2pp0—f) pil—-N+pif 
CONTRIBUTION TO ue m 4 
TOTAL PHENOTYPE 


We write the mean as a function of f, 
Y,-( —fXp2 A + 2pyp; D — pi A) + f(p3 A — pA) 
= (1 — f)Y, +fY,, 


where Y, is the mean value with random mating (f= 0) and Y, is the mean 
value with complete homozygosity (f= 1). 
Notice that this can be written as 


Y= Y, + (Y, — Y, 3.10.2 


3.10.1 


illustrating once again, as was shown in Section 3.6, that the phenotype is 
a linear function of f. Equation 3.10.2 is the same as 3.6.1. 

We now write the expression for the contribution of this locus to the 
total population variance. 


V, = (1 —f (pj A? + 2p; P2 D? + pj A’) + f(p, 4? + p; A’) - Y? 
- (1 — £V, + Y3) * f(Vi + YD - [(1 2 Yo + / Y? 3.10.3 
-(I— f)Vo * fV, +f — £X Y, — Yy', 


where V, and V, are the variance with random mating ( f — 0) and complete 
inbreeding (f= 1). This formula comes from Wright (1951). 
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This shows that, unlike the mean, the variance is not a linear function 
of f but is quadratic. 

Later, in considering the effects of selection, we shall see that the rate of 
gene frequency change depends on the additive component of the gene effect. 
Therefore it is of special interest to examine the effect of inbreeding on the 
mean and variance of a locus without dominance. 

Without dominance there is no effect of inbreeding on the mean, as can 
be seen by letting D = 0 in 3.10.1 or 3.6.1. Y = Y, = A(p; — p), 


Vo = pj A? + p} A? — Y2 = 2p, p; A’, 3.10.4 

V, = pA? + p A? — Y? = 4p, p, A? = 2V,. 3.10.5 
Thus, with no dominance the total variance is 

V, — (1 — f)Vo + fV, = Vo +S); 3.10.6 


which in this case is linear rather than quadratic. A population within which 
there is some consanguineous mating has an increase in genetic variance 
proportional to f, provided there is no dominance. 

If there is subdivision of the population into inbred strains, we can 
measure the variance within and between such strains. From 3.10.4. we see 
that the variance of a randomly mating population for a nondominant locus 
is proportional to the amount of heterozygosity, measured by 2p,p;. In a 
strain of inbreeding coefficient f the heterozygosity is reduced by a fraction 
J; hence the variance is reduced by this amount. 

The effect of any inbreeding which is of a type that divides the population 
into isolated groups will be to decrease the variance within groups, to increase 
the variance between groups, and increase the total variance. This happens, 
for example, in the development of a series of inbred lines of plants or 
livestock. 

These conclusions apply only to the special case of no dominance. With 
varying degrees of dominance explicit formulae for variance within and 
between lines become more difficult (Robertson, 1952; Wright, 1952). This 
is discussed in Chapter 7. 

A case of special interest arises when there are recessive alleles of low 
frequency. In this circumstance the variance within a subpopulation increases 
during the early stages of the inbreeding process, despite the fact that the 
population is becoming more homozygous. The reason is that inbreeding 
brings out previously hidden recessive factors which now can contribute to 
the phenotypic variance. This increase will continue until it is eventually 
offset by the increasing homogeneity in gene content from individual to in- 
dividual within the subpopulation. 
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3.11 The Inbreeding Effect of a Finite Population 


As was mentioned in the introduction to this chapter, there is a decrease 
of heterozygosity in a finite population even if there is random mating within 
the group. Each generation may be regarded as being made up of 2N gametes 
drawn from the previous generation and which combine to make the N 
individuals of this generation. The gene frequency will therefore change 
somewhat, the amount depending on the smallness of the population. The 
change will have a variance of p(1 — p)/2N where p is the frequency of the 
gene under consideration in the parents. We have assumed here that any 
particular successful gamete is equally likely to have come from any one of 
the parents, so that the situation is analogous to binomial sampling of 2N 
gametes (see A.5.5). 

It might not seem obvious that random changes in gene frequency, which 
can be in either direction, will on the average cause a net decrease in hetero- 
zygosity and increase in homozygosity. One way to visualize this is to regard 
each of the 2N genes at a locus in the N parents as individually labeled. The 
effect of random processes in drawing (with replacement) from these is that 
some will be omitted entirely while others will be drawn more than once. 
Thus there will be a certain amount of identity next generation. If the process 
continues long enough, all the genes will be descended from a single individ- 
ual gene and complete autozygosity will be attained. 

The distribution of the probabilities of various gene frequencies during 
this process is a difficult problem. On the other hand, the average change in 
heterozygosity is uniform and easily derived. It can be expressed as a function 
of the inbreeding coefficient. 

Consider first a population with completely random mating, including 
self-fertilization. As before, we consider a single locus. Imagine that the 
progeny are produced by drawing random pairs of gametes from an infinite 
pool to which each parent had contributed equally (or, if the pool is finite, 
the drawing is with replacement). Two gametes then have a chance 1/2N of 
carrying identical genes, since the N diploid parents have 2N genes at this locus. 
Two gametes have a chance of | — 1/2N of carrying different parental genes. 

In the first case, the probability of the genes being identical is of course 1. 
In the second case the probability of their being identical is f,_,, the inbreed- 
ing coefficient of an average individual in the previous generation. The reason 
for this is that the two alleles were drawn at random from the parent generation, 
and since the parents were the result of random mating, the probability of 
any two alleles being identical is the same as that for two in the same zygote; 
the latter is, by definition, f,..,. Therefore, 


l 1 
hasy (i - gs) 3.11.1 
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Recalling that H,, the heterozygosity at time t, is Ho(1 — f), we obtain 
by substitution in 3.11.1 


1 Iy 
H= (1 gs) = (1-3) Ho. 3.11.2 


The result is the very simple one ; despite the complexities in the changes 
in gene frequencies, the average heterozygosity decreases by a fraction 1/2N 
each generation. 

As stated earlier, this formula assumes completely random mating (that 
is, random combination of gametes) including the possibility of self-fertiliza- 
tion. Note that, when N = 1, the results agree with the formula for self- 
fertilization, as expected (3.8.1 and 3.8.2). 


No Self-fertilization We now assume that gametes are combined at 
random, but with the restriction that two uniting gametes cannot come from 
the same parent. We let f, be the inbreeding coefficient at time t, as before, 
and let g, be the coefficient of consanguinity of two different randomly 
chosen individuals in generation t. The inbreeding coefficient in generation t 
is clearly the same as the consanguinity coefficient in generation t — 1, since 
mating is at random. To get the consanguinity coefficient in generation t, 
we note that the two chosen genes (one from each individual) have come 
from the same individual in the previous generation with probability 1/N 
and from different individuals with probability 1 — 1/N. In the first case the 
probability of identity is (1 + /, .,)/2 as explained earlier (see, for example, 
3.4.1 and 3.8.1); in the second case it is g,_,. Putting these together, we obtain 


Si = i-is 
N-I 3.11.3 
N 91-1: 


Substituting the first into thc second gives 





l ! 
g = ay 0 Tf-)0- 


l ad 
fm CFL tS, 


which, after going back one generation and rearranging, is 


f =f- Tb 2fi-1 tf -32N. 3.11.4 


Separate Sexes |f the individuals in generation ! — 1 consist of N,, 
males and N, females adding up to a total of N individuals, only a slight 
modification is required. The two sexes necessarily make the same contribu- 
tion to later generations, since each fertilization event involves one maternal 
and one paternal gamete. Therefore the probability that two genes in different 
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individuals in generation t are both derived from a male in generation t — I 
is 1/4; and that they came from the same male is 1/4N,,. Likewise, the proba- 
bility of their coming from the same female is 1/4N , . Then the probability that 
the two genes came from the same individual in generation t — I is 


] & 1 1 

4N, 4N, Në 
where N, is the effective number of individuals in generation t — 1. 

This means that equations 3.11.3 and 3.11.4 are correct with separate 
sexes; it is only necessary to use N, instead of N in the formula. Notice that 
when N, = N; = N/2, then N, = N. When the two sexes are equally frequent, 
the actual number and the effective number are the same. 

The effective population number is used with a wider definition than this 
single example would suggest. If there are fluctuations in the population 
number from time to time, or if the distribution of number of progeny per 
parent is nonbinomial, or if there is any other kind of deviation from the 
idealized model that we have assumed, then it is conventional to define for 
that population an effective number. The effective number then, as in this 
example, is the size of an ideally behaving population that would have the 
same homozygosity increase as the observed population. We shall return to 
this subject in Section 3.13 and in Chapter 7. 

Notice that, when the number of males and females is equal, there is no 
distinction (as far as rate of homozygosity change caused by random gene- 
frequency drift) between a population with separate sexes and a hermaph- 
roditic population without self-fertilization. 

Returning to equation 3.11.4 we can make the same substitution as 
before, H, = Hg(1 —/J,), and write an equation for the heterozygosity in 
successive generations. This yields 

Soc tag Hen. 3.11.6 

Notice that when N = 2 equations 3.11.4 and 3.11.6 reduce to the 
equations for sib-mating (3.8.4 and 3.8.5) as expected. This comparison may 
be misleading in one regard, however. Part of the increase in homozygosity 
in larger populations is due to the fact that different members of the popula- 
tion leave different numbers of descendants. The general formula takes this 
into account, and with larger N it turns out that about half of the increase 
in homozygosity is caused by the restricted number of parents and the other 
half by differential contribution of offspring. Equation 3.11.6 assumes under 
these circumstances that the distribution of progeny-number is binomial, 
as follows from the assumption that each progeny gene has an equal chance 
of having come from any parent. However, as is seen, the formula is still 


3.11.5 


H, = 
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correct in the limiting case of sib mating, despite the fact that the progeny 
number per parent is necessarily constant in this case, 

The ultimate ratio by which H, decreases each generation is given by the 
larger root of the quadratic equation 


N-I l 
2 — [—— | — — = 0. 11, 
À ( " Ja ON 3.11.7 


This equation is easily obtained by setting H,/H, , = H, ,/H, , =A in 
3.11.6. 
The relevant solution to the equation is 


QNT IE VN 1 


3.11.8 
2N 
or approximately, unless N is very small, 
TRU POOL 3.11.9 
2N +1 
or, when N is large, 
| — à ~ —. 3.11.10 
2N 


Hence, in a moderately large population the average heterozygosity is 
reduced by about 1/2N per generation whether there are separate sexes or 
not, provided that the two sexes are equal in frequency. 

The accuracy of the approximation 3.11.9 for small N can be seen by 
comparison with the data in Table 3.1.2. The last column in this table gives 
the proportion by which the heterozygosity is reduced, which is to be compared 
with the approximation 1/(2N + 1). Within 6 generations the approximation 
is quite good: with N = 2, I/(2N + 1) = 1/5 or .20 whereas the exact answer 
is .19. So if the population is anywhere near a steady state, 3.11.9 is very good 
even for quite small numbers. 

The results of this section were all obtained first by Wright (1931) by his 
method of path coefficients. A summary of this method and its applications 
to problems in population genetics is given in his Galton Lecture (Wright 
1951). 


3.12 Hierarchical Structure of Populations 


At the beginning of this chapter we emphasized that inbreeding can occur 
under two quite different circumstances. In both cases there is a decrease in 
heterozygosity measured by f, but some of the consequences are quite different. 
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In the first case we can have a large population within which isolated 
consanguineous matings occur. The inbreeding coefficient measures the 
average decrease in heterozygosity and equations 3.2.1—3.2.3 give the 
expected genotypic frequencies for given gene frequencies. As soon as random 
mating occurs the inbreeding coefficient returns to zero. An example is the 
case of partial self-fertilization discussed in Section 3.8; as soon as self- 
fertilization is prevented, the original heterozygosity is restored. (We should 
perhaps mention that here, as throughout this chapter, we are ignoring the 
effects of selection and mutation.) 

On the other hand, there may be inbreeding because of restriction of 
population number, even though mating is at random within the population. 
The average heterozygosity within the population is reduced and the in- 
dividuals become more closely related, both measured by f (i.e., f, and fij). 
But, as emphasized in Section 3.11, the increased homozygosity within the 
population is not due to departure from Hardy-Weinberg ratios, but to 
changes in the gene frequencies. The change is such as to make the value of 
2p,p; (or Ep, p; with multiple alleles) decrease in proportion to f. The formulae 
3.2.1-3.2.3 hold only in the sense of giving the genotype frequencies averaged 
over a whole series of such populations. 

In this situation the loss of heterozygosity within the population is 
permanent and could be restored only by crossing with other populations. 
Repeated self-fertilization and sib mating constitute extreme cases of small 
populations. 

There are circumstances in which the two effects are combined. For 
example, we might inquire about the inbreeding coefficient of an animal 
whose parents were sibs in an isolated population of effective size N. His 
homozygosity will be greater than if his parents were sibs in a large population. 
We now derive a procedure to handle this situation. The conclusions were 
first reached by Wright (1943, 1951) by a different method. 

Let S be a subpopulation derived by isolating a finite number of in- 
dividuals from a large total population 7. For example, S could be a breed 
or a strain isolated from a foundation stock T. Or, S could be one of a series 
of geographically isolated subpopulations of a large population 7. Let / be an 
individual within subpopulation S. 

We now define frs as the probability that two homologous genes in / are 
derived from the same gene in a common ancestor within the subpopulation. 
Let f5, be the probability that two homologous genes, chosen at random 
from the subpopulation, are both descended from a gene in the subpopulation. 
We let fır be the overall probability of identity in individual 7. 

The probability of nonidentity is then the product of two terms: (1 — ffs), 
the probability that the two genes do not both come from a gene in a known 
common ancestor in the population, and (1 — /57), the probability that if the 
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two genes are randomly chosen from within the population they will not be 
identical because of some more remote relationship. Therefore, 


l — fir = (1 — fisXl — fs). 3.121 


or 


Sir 9 fsr + (l — fsrYis- 3.12.1a 


For example, what is the inbreeding coefficient of a child whose parents 
were cousins on an island whose population is descended from a shipwreck 
10 generations ago? Assume for simplicity that there were 50 survivors, 
equally divided between the two sexes, and that the population has remained 
of this size and sex distribution since that time. From equation 3.11.2 
(accurate enough, although it would be better to use 2N + 1 instead of 2N) 
we have 

] 


10 
— fsp=(1-—) =0.90 
je ( Td 9045 


and 
1 f =) : = 0.9375 
IS 16 ' , 


1/16 being the inbreeding coefficient of a child of a cousin marriage. From 
3.12.1, 


| — fir = (0.9045)(0.9375) = 0.848 
and 
fir = 0.152, 


compared with 0.0625 for cousin marriage in an infinite population. 

Students of animal breeding will enjoy Wright's (1951 and earlier) 
application of these methods to the history of Shorthorn cattle. He showed 
that there was a substantial increase in the inbreeding coefficient fyr of British 
Shorthorns, almost entirely due to /5; and hardly at all because of con- 
sanguineous matings within the breed ( f;s). 

Similar analyses are possible in human isolates if accurate pedigree rec- 
ords are available. 1f they are not, it is sometimes possible to get reasonably 
satisfactory information from marriage records. Since a person's name is 
inherited as if it were linked to the father's Y chromosome, marriages between 
persons of the same surname (isonymous) can be used as indications of 
consanguinity. Using this procedure on the Hutterite population, Crow 
and Mange (1965) were able to show that fs, in this population is quite 
appreciable, about 4%, but that fis is not significantly different from 0. 
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In other words, the increased homozygosity is due almost entirely to the small 
effective size of the population and not at all to nonrandom marriage within 
the isolate. 

Notice that 3.12.1 may be written as 


Ho(1 — fir) 
Ho - fsr) 


which shows that | — fj; is a measure of the heterozygosity of an individual 
in the subpopulation relative to that of an individual derived from random 
mating in the same subpopulation. 

As was mentioned earlier, the inbreeding coefficient can be described 
in terms of correlation as well as in terms of gene identity. In fact Wright's 
original definition and derivation was through correlation analysis. One 
advantage of the correlation interpretation, as opposed to the probabilistic, 
is that negative values have a meaning. This is especially useful in this section. 
Fjs is the correlation between homologous genes in an individual relative to 
genes chosen at random from his subpopulation. F,7 is the correlation between 
homologous genes in an individual relative to the whole population. Fsz is 
the correlation between randomly chosen genes in the subpopulation relative 
to the total population. š 

Fs, is necessarily positive, but the others need not be. For example, 
if there is specific avoidance of matings between related individuals within 
the subpopulation, Fıs may be negative. 

Wright defined f;; as the correlation between two random gametes 
from the same subpopulation and derived the relation 


1 — fis = 3.12.2 


V, 


p(l— p 
where p and V, are the mean and variance of the gene 4 among the subpopula- 
tions. We can derive this as follows, using 3.12.1. 

Consider a pair of alleles, 4 and A', and let p be the frequency of A. 
Then the frequency of a heterozygote in the subgroup with frequency p 
is 2p(1 — p)(1 — fıs). Thus the frequency of heterozygotes in the total popula- 
tion is 


H = EQp(1 — pY1 — fis) 


where E( } designates taking the expectation or average over all subpopula- 
tions. If the variation of gene frequencies among subgroups is independent of 
fis, then 


H = X1 —fis)E{p(1 — py. 


Sst 


3.12.3 
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But 
E(p(1 — p)} = E(p — p°} 
=p-P-Y, 
(see A.2.2 or 2.9.2). Thus 
H = X1 —/fis)\(p — p? — V,). 3.12.4 
On the other hand, from the definition of fir, 


H = X1 — fil — p). 3.12.5 


Equating these two expressions gives 


V, 
1— =(1- ( — |. 
a- 43-0 - 43|1 - x1 —5 
Comparing this with 3.12.1 gives us the desired expression, 3.12.3, 
Notice that if there is no inbreeding in the subpopulation (fıs = 0), then 


H = 2p(l — p) — 2V,. 3.12.6 


This, although expressed in terms of heterozygotes rather than homozygotes, 
is the same as we described as Wahlund's principle in 2.9.2. 

Nei (1965) extended this procedure to cover the case of multiple alleles 
and has shown that 


Covi; 


Isr aa. 3.12.7 
where p; and p; are the mean frequencies of the alleles 4; and A, and Cov,, is 
the covariance of their frequencies. (See problem 22, Chapter 1.) 

Nei and Imaizumi (1966) applied these formulae to study the differentia- 
tion of ABO blood group gene frequencies among the prefectures of Japan. 
They obtained 6 values of fs that were in close agreement, suggesting that the 
local differentiation of gene frequencies was mainly random. 

Equation 3.12.1. can easily be extended to include subdivisions of a 
subdivision in a hierarchy. For example, 


(1 — fir) =(1 — figa — fasX! — Fsr), 3.12.8 


where R is a subpopulation of S which is a subpopulation of T. Then frs 
is the probability that two randomly chosen homologues in R are identical 
because of common ancestry within R and fs, is the probability that two 
randomly chosen genes in S are identical because of common ancestry 
within S. 
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3.13 Effective Population Number 


In our discussion of random drift in gene frequencies and the resulting 
decrease in heterozygosity in a finite population we have assumed, in addition 
to random mating, that the expected number of progeny is the same for each 
individual. That is to say, we regarded a successful gamete as being equally 
likely to have come from any individual in the parent generation. We also 
assumed that the population size remains constant from generation to genera- 
tion. Neither of these conditions is likely to be met in nature. 

In equation 3.11.5 we introduced the concept of effective population 
number by noting that, if the numbers of males and females differ, we can 
replace N in the formulae such as 3.11.6 by N,, the effective number, where 


1 1 1 


Ne AN, AN, 
or 


ANm Ny 

N, = N +N, 3.13.1 

Notice that the value of N, is influenced much more by the smaller than 
by the larger of N, and Np. For example, if N,,= ! and N; = 100, N. is 
about 4; so, in highly polygynous species the number of males is much more 
important than the number of females in determining the amount of random 
drift in the population. N, is proportional to the harmonic mean of N,, and 
N,, and the harmonic mean is more strongly influenced by the smaller values. 

We can also use the concept of effective population number when the 
true size varies with time. From 3.11.2 we note that the heterozygosity after t 
generations, when N varies from generation to generation, is 


ic - (i-a om) (ta) 
Hy 2No 2N, 2N, 2N,., 


oom) 


We then ask: What population of constant size would have the same decrease 
in heterozygosity over the time period involved? We call this number the 
effective population number. 

To find this, we write 


H, 1 t Tí 1 | 
—-—[1- = ae à 
Ho ( zx) it 2N; 3.13.3 


which can be solved for N,. 


3.13.2 
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Notice that if the N;'s are fairly large and t is small, this equation is 
roughly 


t l 
= =1 Ti 
! 2N 2 2N 
or 
1 lol 
aay, 3.13.4 
N, (^N, 


So, if the population size fluctuates, the effective population number is roughly 
the harmonic mean of the various values. 

In Chapter 2 we showed that with varying population growth rates at 
different times the arithmetic mean is appropriate as a summarizing value if 
fitness is measured by the Malthusian parameter, m (1.2.5), and the geometric 
mean is used if fitness is measured by w (1.1.5). Here we find that still a third 
average, the harmonic mean, serves best as a single representative value. 

' Finally, if the individuals in the population do not have the same 
expected number of progeny, the effective number will be less than the census 
number. If the population size is constant 


4N —2 
e= o+2 3.13.5 
where c2 is the variance in the number of progeny per parent. This is derived, 
along with more general formulae, later in Section 7.6. 

This formula assumes complete random mating, including the possibility 
of self-fertilization. Notice that when the number of progeny per parent has a 
binomial distribution with mean 2 (the mean must be 2 in a sexual population 
that is neither increasing nor decreasing) this is equivalent to drawing a 


sample of 2N gametes randomly from the N parents. The binomial variance 


is 
gea oe 


Substituting this into 3.13.5 gives 
Ne =N, 


as it should. In this ** ideal" population the actual and effective numbers are 
the same. 

Notice that the ideal population model when N, = N does not imply a 
constant number of progeny per parent, but a randomly varying number. If 
the number has greater than binomial variance the effective number is smaller 
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than the census number; if the variance is less than binomial, the effective 
number is larger. An extreme case is that where each parent is constrained 
to have the same number of progeny, as in many livestock or laboratory 
animal breeding systems. If c2 = 0 in 3.13.5 then N, = 2N — 1, almost twice 
the census number. So, with random mating and binomial progeny distribu- 
tion, about half the reduction in heterozygosity is due to consanguinity among 
mates; the other half is caused by variable numbers of progeny. 

In most populations in nature the effective number is less than the census 
number. For a discussion of laboratory and census data on this question, see 
Crow and Morton (1955). 

The concept of effective population number and the formulae in this 
section are all due to Sewall Wright (see especially his 1931 and 19385 papers). 
For extensions to more complicated situations see Crow (1954), Crow and 
Morton (1955), and Kimura and Crow (1963a). We shall return to this subject 
in Section 7.6. 


3.14 Problems 


1. In Table 3.1.1, if the initial frequencies are in Hardy—Weinberg ratios, 
show that in generation t, D, = p(1 — 2^ ‘g), where p and q are the fre- 
quencies of the dominant and recessive genes. 

2. In deriving 3.8.19 we rather arbitrarily subtracted 2(1 — S)Ho/(2 — S) 
from both sides of the equation. Show, by equating H, to H,. ,, that this 
is an equilibrium value (and that therefore the procedure was not arbi- 
trary). 

3. Is individual J inbred ? 


D i ys ye 
N Ni XX 


4. The algorithm for computing the inbreeding (or consanguinity) coefficient 
(3.4.1) permits no individual to be counted twice in the same path. 
Invent an algorithm that can dispense with the (1 + f/4) term by permitting 
the path to pass through some individuals twice. (Figure 3.4.3 provides a 
good example.) 

5. What is the rate of decrease in heterozygosity from mother to daughter 
in mother-son mating in honeybees? Remember that the male bee is 
haploid, being derived parthenogenetically from the mother. Thus the 
rule is the same as for the X chromosome in ordinary species. See Figure 
3.14.1. 
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Figure 3.14.1. 
Repeated 
mother-son 
mating in 
honeybees. 
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6. Compare the results of parent-offspring mating for an autosomal locus 
with that for a sex-linked locus (or for bees) in problem 5. See Figure 
3.14.2. 
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Figure 3.14.2. 
Repeated 
parent- 
offspring 
mating for 
autosomal 
locus. 
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Figure 3.14.3. 
Repeated sib 
(or half-sib), 
mating in 
honeybees. 
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. Show that continued sib mating leads to the same rate of decrease in 


heterozygosity in honeybees (or for an X-linked locus) as in ordinary 
diploid inheritance. See Figure 3.14.3. 


. Artificial insemination in bees requires pooled sperm from several males. 


If these males are brothers, will this alter the results of problem 7? 


. Show that, with partial self-fertilization, whenself-fertilized plants produce 


fewer progeny than cross-fertilized the gene frequency does not change 
(although, of course, the rate of increase in homozygosity is less). 


. Is the rate of decrease in heterozygosity by random drift the same for 


a hermaphroditic species, such as earthworms, where individuals produce 
both eggs and sperm but do not self-fertilize, as for a bisexual species 
with equal numbers of males and females? 


. Show that with continued sib mating the heterozygosity, H,, is given 


by H, = H,-, — (1/8H,- 5. 


. How would you expect inbreeding to affect the rate of approach to 


linkage equilibrium? 


. Generalize the formula for maximum avoidance of inbreeding to a popula- 


tion of size 8. What is the value of 2? 


. Figure 3.14.4 gives an example of a mating system that reaches an equi- 


librium level of heterozygosity rather than approaching 0. Show that the 
eventual heterozygosity alternates. between the original amount and 2/3 
this amount. (You might enjoy generalizing this to two generations of 
sib mating between outcrosses. The inbreeding coefficient then cycles 
through the values 0, 5/14, and 3/7 at equilibrium.) 
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Figure 3.14.4. A mating system in which sib-mating is alternated with out- 
crossing. 


15. Show that, starting with a randomly mating monoecious population of 
size N, the inbreeding coefficient after t generations is | — [(2N — 1)/2N]'. 

16. What is the mean number of backcross generations required to separate 
a gene from the selected locus if the recombination frequency is 0.1? 

17. If 10 generations of backcrossing are carried out, what is the probability 
that the genes are not yet separated ? 

18. How many generations of backcrossing must an experimenter carry out 
if he wants a probability of at least 0.95 that the genes have separated? 

19. Show that the variance of the number of backcross generations that a 
gene will remain linked (see 3.8.21 for the mean number) is (1 — r)/r?. 
Hint: Note that when Z=1 cy cy? yl s, 

2 

3-4: 1+4y + 9y? + l6y> +7. 

20. Show that with maximum avoidance of inbreeding in a population of 
size 4, 


h, = th,-4 + th,-2 + Vni . 


21. Comparing this to the formula for sib mating, which is maximum 
avoidance in a population of size 2 (3.8.5), and that for self-fertilization 
(size 1), you can probably guess what the generalization is to populations 
of size 8, 16, etc. Show this for a population of size 8. (See Problem 13) 

22. Show that the variance of x in 3.8.20 is 1/t?. 


CORRELATION 
BETWEEN 
RELATIVES 
AND 
ASSORTATIVE 
MATING 








n this chapter we consider the analysis of population variance, the corre- 
lation between relatives, and the effect of inbreeding and assortative 
mating on both of these. 

Since assortative mating is the mating of individuals with similar pheno- 
types, and since the phenotype is to some extent a reflection of the genotype, 
we should expect assortative mating to have qualitatively the same conse- 
quences as inbreeding, which is the mating of genotypically similar individuals. 
We shall see that this is true, but there are interesting quantitative differences. 

A recurring problem in genetics is the separation of genetic from 
environmental effects. The first thing to do is to define the question more 
precisely, for clearly it gets us nowhere to try to answer such a question as 
“Is heredity or environment more important?” For one thing we must 
specify what trait is being discussed. We then ask questions of this type: 
Of the variability existing in a population, how much is caused by differences 
in the genetic composition and how much is caused by environmental 
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differences? In principle we could hold the genotype constant and see how 
much variation remains in the population; this remaining variation would be 
environmental. We could also hold the environment constant; the variability 
then remaining would be due to genetic differences. 

These two quantities should add up to the original variability, but only if 
we have chosen a proper measure of variability, and if the genetic and environ- 
mental factors are independent in occurrence and action. The fact that the 
variance of the sum of two independent quantities is the sum of the variances 
(see A.2.12) means that the variance is a suitable measure. We shall see in 
Chapter 5 that it is also naturally related to the way that selection acts on the 
population. We now consider ways in which the population variance may be 
subdivided into fractions assignable to various kinds of gene action and the 
environment. 


4.1 Genetic Variance with Dominance and Epistasis, 
and with Random Mating 


The variance of any trait in a population is in general determined partly by 
genetic factors, partly by environmental factors, and, when these are not 
independent, their covariance. We are now primarily interested in that part 
of the variance which is determined solely by genetic differences among the 
individuals in the population. This we call the genotypic variance, V, ; it is 
also sometimes called the total genetic variance. Later we shall bring in the 
environmental components as well. 

Consider first a single locus with two alleles, A, and A, . Suppose that the 
average phenotype or "' yield" of A,A, individuals is Y,, = a + a,,, that of 
A,A, is Yi; — à +a, and of A, A, is Y, —à + a22, wherea is the popula- 
tion mean. The phenotype may be any quantity of interest —height, weight, 
yield, fertility, or fitness, for example—measured in any convenient units. 
Ordinarily we are considering traits that are influenced by many gene loci 
so that the quantities d,,, d, ; , and a5; are small deviations introduced by gene 
substitutions at this particular locus. The value of the Y's is determined mainly 
by other loci and by the environment. We call a quantity like a,, the average 
excess of the genotype (in this case, A, A,). In this example it is also the average 
effect of the genotype. Later, in Chapter 5, we shall show that when the geno- 
types are not in random-mating proportions, there is a difference between the 
effect and the excess; but for the moment they are indistinguishable because 
we are assuming Hardy- Weinberg ratios at the locus under consideration. 

Since @ is the mean value of the phenotype, letting p, and p, be the fre- 
quencies of alleles 4, and A; , we have 


ā = pi(à + ay.) + 2p py(à + 413) + pX(à + 422), 4.1.1 
since p?, 2p, p; , and p2 add up to |. 
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We wish now to assign a value to each allele, «, for A, and a, for 42. 
These will be called the genic values. These are measures of the contribution 
of each gene, averaged over all genotypes into which this gene enters, and 
measured as a deviation from the mean. More precisely and specifically, o, 
isthe average effect of choosingan alleleat the A locus at random and replacing 
it with A,. If the allele chosen happens to be A,, then of course the replacement 
has no effect. The genic value of 4,4; , for example, will bea + a, + a. 

The genic and genotypic values (which, since we are ignoring the environ- 
ment, are the same as the phenotypic) are given below. 


GENOTYPE AA, A142 Az A2 
FREQUENCY pi 2pipz pi 
GENOTYPIC VALUE Yı: =å8 +a; Yi2 —d +a:2 Ya: —d + a22 
GENIC VALUE à + 2a, d + oi d a2 a+ 2a2 


The genic value is also sometimes called the additive value, since it is a linear 
approximation to the genotypic value. 

Soon we shall discuss the way in which the numerical values of the a’s 
may be determined. For the moment they remain as abstract symbols. First, 
we want to define the genic and genotypic variances. To do this, note that 
4.1.1 can be written 


pia; + 2pipaodia + piaj,-0 4.1.2 


which illustrates the principle that we have used several times: The sum of the 
deviations from the mean is 0. For the same reason 


piQa,) + 2p, p2(%, + &2) + p2(2a2) = 0, 
which is the same as 
pi, + P2% = O. 4.1.3 


The genotypic variance is the total variance, since there is no environmental 
effect in this case. From the definition of the variance, that is 


V, = piat: + 2pipz2 12 + pi ais. 4.1.4 
The genic variance is 
V, = pia, + 2p, pa, + az)? + piQa;) 
= (2piai + 2p,pzai) + Qpiai + 4p, pa aa; + 2p2o2) 
+ (2p\p2a3 + 2piaj) 
= 2p\ay + 2(pi%, + P2 az)? + 2p;aj. 
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But, by making use of 4.1.3, weseethat the middle term drops out, and we have 
V, = (pyar + p23). 41.5 


There is considerable variability in the literature of population genetics 
and animal breeding in the words used for these expressions. What we have 
called the genic variance (so called because it reflects the direct contribution 
of the average values of the genes) is also called the additive genetic variance 
(because it measures the additive effects of the genes). What we have called 
the genotypic variance is also called the total genetic variance. Since it is the 
total hereditary contribution to the variance we have symbolized it V, for 
mnemonic ease. We shall avoid using genetic variance when we intend a 
precise meaning, since it is used in the literature both for the genic and 
genotypic variance. 

Equations 4.1.4 and 4.1.5 are written so as to suggest immediately the 
form of the equation when it is extended to multiple alleles. In both cases 
the summation is simply continued. 

For two alleles it is for some purposes more convenient to write 4.1.5 as 


V, = 2p, p2(a, — a5)". 4.1.6 


We now are ready to determine the values of the a’s. This is done, as 
first suggested by Fisher (1918), by the method of least squares. At the moment 
this seems arbitrary, but we shall show later in this chapter that this leads 
to a natural formulation for the correlation between relatives and in Chapter 5 
that the genic variance measured in this way gives the rate at which the popula- 
tion changes under selection. 

To get the least squares estimates, we choose the a’s so as to minimize 
the quantity 


Q= pilai = 2a)? + 2p,p;(ai2 — a, — a)" + P3(a22 = 2a2)?. 4.1.7 


In other words, we minimize the weighted average of the squared deviations 
from the linear estimate, or the squared differences between the genic and 
genotypic values. 

We get the minimum estimates in the standard way by differentiating Q 
and equating to 0. This gives 

00 


2a. = —4[pi(ai, — 201)  p1p2(a15 — 81 — a5)] = 0, 
1 


ô 
e = —4[pipi(a, — % — a) + P3(a22 — 2a,)} ^ 0. 
2 


4.1.8 


If we add these two equations and recall 4.1.2 we verify the relation 4.1.3; 
the «’s have been determined so as to be measured as deviations from the 
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same mean d as the a's, as was intended in the model. Making use of this 
relation, the two equations are easily solved to yield 


&ı = p,Q,, t P2412; 
€, = p104,2 t P2422. 


4.1.9 


Notice that these values for the «’s make sense. If we write 


= Did, + PiP2 412 
Ly CE 
Di 


this says that a, is the average deviation from the mean of all the genotypes 
containing the 4, allele, weighted by the frequency of the genotype and by 
the fraction of its alleles that are A, (i.e., by one for A,A,, by one-half for 
A,43). 

Y ou might wonder why, if we can obtain the a's in such a simple way, 
we bothered with the least squares method. The reason is that this simple 
value for a is true only if the population is in Hardy-Weinberg proportions. 
When the situation is more complex, as in some examples in Chapter 5, 
the least squares approach is needed. So we have introduced the procedure 
at this time, even though it is not needed for this example. 

Returning to the problem, we substitute the values from 4.1.9. into 4.1.6 
to get an expression for the genic variance, 


Y= 2P1 Pal Py(411 — 412) + P2(a,2 — a5). 4.1.10 


Likewise, subtracting this from the genotypic variance, 4.1.4, gives the 
residual variance, which is attributable to deviations from the linear model 
caused by dominance. This is 


V, = pipi(ai; — 24,5 + a>)’. 4.1.11 


Notice that, in the absence of dominance, a,, is the average of a,, and 
a,,. Under these circumstances d,, — 2d,; + a5; — 0, and the dominance 
variance disappears, as it should. 

Our procedure has been to choose the «’s to minimize the weighted sum 
of the squares of the deviations of the phenotypic values from the additive 
value. This sum is the same as the dominance variance. We got the dominance 
variance simply by subtracting the genic variance from the genotypic variance, 
a process which contains the hidden assumption that these two quantities are 
independent and therefore additive. That this is in fact correct can be verified 
by getting an expression for the dominance variance directly and comparing 
it with 4.1.11, as follows. 

Notice, from equations 4.1.8, that 


pila ,1— 22,) = — p,py(a15 — à, — a5) = pY(a;, — 2a) = K, say; 4.1.12 
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then the dominance variance (which is Q in 4.1.7) is 
2 l | K? 


+ — | = ——. 4.1.13 
pi] pipi 


l 
V= K? E +—— 
: Pi Pip 
From equations 4.1.9. 


d,2 — X, — 95 = d45 — D,041 — P2042 — P1812 — P2422 
= —p1011 — P2422- 
Since 
pia, *2pip;aii + pid22 =0 (from 4.1.2), 


we can add this quantity without changing the value. Doing so, and re- 
arranging, gives 


42 — % — 23; = —py pray, — 2a, + a22). 4.1.14 
Substituting this into 4.1.12 and 4.1.13 gives 
Vs, = Pipi(ai — 2a, + a>2)’, 


in agreement with 4.1.11, as was to be shown. 

When the effects of the different loci are independent (that is, no epistasis) 
the covariances between the contributions of the different loci are 0 and the 
variance for all loci is the sum of the variances for the individual loci. So for 
all loci together we can write 


V= V+ Vas 4.1.15 


where these symbols now refer to the genotypic, genic, and dominance 
variances for all loci combined. 

Likewise, if the environmental and genetic effects are independent, their 
variance contributions are additive. Furthermore, just as we obtained domi- 
nance and genic variances which are independent, we can use the same 
procedures to compute an epistatic variance. We shall consider epistatic 
variance and its subdivision later in this section. Altogether, when genetic 
and environmental effects are independent 


V, =V, + Va+ V; + Ve 
= V, + Vos 


4.1.18 


where 


V, is the phenotypic (total) variance, 
V, is the genic (additive genetic) variance, 
V, is the dominance variance, 
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V; is the interaction (epistatic) variance, 
V, is the environmental variance, 
V, is the genotypic (hereditary, or total genetic) variance. 


The ratio, V,/V,, is often called the heritability. It is widely used by animal 
and plant breeders to predict the change produced by selection. 

Weshould emphasize that the formulations in 4.1.16 make the assumption 
of no genotype-environment interactions. This is one of the greatest practical 
difficulties in using these formulae. It may be necessary to introduce covariance 
terms into the expression to deal with this; in order to determine the value of 
such terms one would need to know the effect of different genotypes in differ- 
ent environments. It is particularly difficult in human genetics, of course. 
Here the effects of genotype and environment are usually confounded and 
heritability estimates are difficult at best, and often impossible to interpret. 

On the other hand, the procedure we have used guarantees that the genic 
and dominance variance components are independent. This is also true of 
the extension to epistasis. 

Notice that the statistical procedure has been to determine the a's in 
such a way as to maximize the genic contribution and minimizethe dominance 
variance. This gives a formula for predicting the progeny phenotype from 
knowledge of the parents. It also means that a low dominance variance does 
not mean that the genes are without dominance. It may simply mean that the 
particular combination of gene frequencies and dominance relations allocates 
most of the variance contribution of these genes to the additive component. 

The basis for compartmentalization of epistatic variance is an extension 
of the same principle. As much variance as possible goes into the additive 
component, as much as possible of what is left goes into dominance, and what 
is left over goes into epistasis. Kempthorne has greatly extended the epistatic 
analysis and the details are in his 1957 book. 

As an illustration of the importance of the gene frequency in determining 
the genic and dominance variance components, consider the case of complete 
dominance of the A, gene, which implies that a,, = a,5. We also assume that 
there is no environmental effect, so V, = 0 and V, = V,. In this case V,, V3, 
and V, are in the ratio 2p,p3 : p1p? : p1p*(1 + p2). The heritability, or V/V, , 
is 2p5/(1 + p2). As p; approaches 0, that is when the recessive gene is rare, 
the heritability approaches 0. On the other hand, when the dominant gene 
is rare (p; approaching 1), the heritability approaches 1. 

This accords with our intuitive knowledge from classical genetics. 
When the recessive gene is rare, selection is ineffective; when the dominant 
gene is rare, selection is just as effective as if the genes were additive. 

For a particularly lucid discussion of this material, see Falconer (1960). 
Some examples are illustrated in Figures 4.1.1—4.1.3. See also Table 4.1.1. 
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aa |= 


Figure 4.1.1. The relationship between genotypic 
variance and gene frequency, with no dominance, 
no epistasis, and with random mating. 

A — [(a. — a11)2], V=2pip2 A. 


Figure 4.1.2. Genotypic variance (V,) and genic 
variance (V,) for a locus with complete dominance 
and random mating. The abscissa gives the frequency 
of the dominant gene. The dominance variance is the 
distance between the two curves. A = [(ai, — @22)/2]?, 
0312 7 011. 
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Figure 4.1.3. Genotypic variance (V,) and genic 
variance (V,) for an overdominant locus. The values 
assumed are di; — A22 = 2a and Q12 — Qui — a. 

The abscissa is the frequency of A,. The dominance 
deviation is the area between the two curves. 


Figures 4.1.] through 4.1.3 were adapted from 

Figure 8.1 of Introduction to Quantitative Genetics, 

by D. S. Falconer; Oliver and Boyd, Edinburgh, 1960; 
Ronald Press, New York, 1960. 


Table 4.1.1. The heritability, V;/V,, with three levels of dominance and various 
gene frequencies. The environmental effect is assumed to be 0 and the effects of the 
different loci are additive. 


FREQUENCY NO DOMINANCE COMPLETE DOMINANCE OVERDOMINANCE 
OF GENE A, ai, — 2412 + Q22 = A — d12 012 > 011 — d23 

.01 1 .99 .98 

.05 1 97 .90 

10 l .95 .78 

20 1 .89 .53 

.50 1 67 .00 

.80 1 33 53 

.90 1 18 .78 

.95 1 10 90 

.99 1 02 .98 
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Heritability is also defined in other ways. Sometimes it is defined as 


Hea 

V, 
This gives the fraction of the total variance that is attributable to differences 
among the genotypes. This is useful if the question is the relative influence 
of genotype and environment in determining phenotypic differences. This is 
sometimes called heritability in the broad sense. 

On the other hand, the animal or plant breeder is less interested in this 

than he is in determining that part of the genotypic variability that is respon- 
sive to selection. This is V, , so heritability is defined as 


and is often called heritability in the narrow sense. We have used the capital 
letter for the larger value and the small letter for the smaller. In some cases 
those components of epistatic variance which cannot be easily separated from 
the genic variance, and which contribute to the correlation between parent 
and offspring, are included. 

In this book, unless the contrary is specified, we shall use the word herita- 
bility to mean V,/V,. 

With epistasis the problems become more complicated. The extension 
of the procedures we have been using to multiple interacting loci was 
developed originally by Fisher (1918) and Wright (1935), but has been ex- 
tended more recently by Cockerham (1954) and Kempthorne (1954, 1955). 

The basic procedure is, as mentioned earlier, to take up as much variance 
as possible in the additive term, then as much of the remainder as possible 
with the dominance deviation, and what is left over is attributed to epistasis. 
The epistatic terms can be broken further into those that are due to pairs of 
loci, then after removing this the remaining variance is associated with loci 
taken three or more at a time. The 3-locus epistasis can in turn be removed, 
and so on. One would expect that, unless there are very intricate interactions 
(for example, a trait that is found only when three rare genes are simulta- 
neously present), most of the variance is removed by 2-factor combinations. 

Wright (1935) showed that for one form of epistasis this is exactly true. 
This is the kind of interaction that occurs when there is selection for an 
intermediate phenotype, as when animals of intermediate size are more 
likely to survive and reproduce than those that are too small or too large. 
This introduces epistasis of a rather extreme type, since a gene that increases 
size will be favored in genotypes where most of the other genes are for small 
size but selected against when most of the others are for large size. Such 
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interaction must be quite frequent in occurrence. Wright assumed that the 
selective disadvantage is proportional to the square of the deviation from the 
optimum phenotype. With this model he showed that the total epistatic 
variance is simply the sum of all possible 2-locus components. 

For more general situations 3-factor and higher interactions are un- 
doubtedly involved, but they cannot ordinarily be measured and are probably 
usually small. In any case, we shall consider only the interactions of pairs of 
loci. Extensions to three or more are given by Kempthorne (1955, 1957). 

The relationships are clearly seen in a two-way array (see Table 4.1.2). 
The items in the body of the table are the phenotypic measurements of each 
ofthe nine genotypes, each measured as a deviation from the population mean. 
The first two subscripts refer to the A locus and the second two to the B locus. 
Thus a112 1s the amount by which the phenotype of A,A,8,B, exceeds the 
population mean. Below each of the a's is its frequency in the population. 


Table 4.1.2. Basic calculations for subdividing the genotypic variance determined 
by two independent loci with two alleles each. The frequency of alleles 4i, Az, Bi, 
and B; are Pi, pz,q1, and q2. The a's are all measured as deviations from the average 
measurement in the population. The frequency of each class is given below the 
deviation. 





Adı AiA2 A2A2 MEAN MEAN 
BiB, Q1111 01211 42211 a..11 
piqi 2pip2qi piqi gi Bı 
qı 
BiB: Qiii2 01212 2212 d..12 
2pigigz 4piPa2gíiqa 2piqiqi 2q1q2 
B: 
B: B: 01122 01222 02222 d..22 q2 
piqi 2pipaqi pigi qi 
MEAN au.. Q12.. a22.. 0 
pi 2pipa pi 1 
MEAN a a2 0 
Pi P2 1 


Z= V, + Vs -V-pigiaiui-2pnpigiaiu-: + pigiaii 
W= V, + Va = pia},.. + 2pip2ai3.. + p3 a32.. 
+ gia. 11 + 291924712 + 912.221 


U= V, = 2(piet + pa ai + giBi + q2 P3) 
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At the bottom of the table are the weighted averages of each column. 
For example 


_ Pigia + 2p1d105 01112 + P43 44122 


a 
$us piqi-- 2piqida + Piqi 





2 2 
= 4141111 + 2414241112 + 9241122 


Corresponding quantities for the B locus are on the next to the right column. 
The a's are obtained as before. Thus 


Hy = Pray, P2015, 
4.1.17 
€; = pda, t P2422..> 


and the fs are corresponding quantities for the B locus. 

Putting all this together, we have the variance components as given in 
the bottom part of Table 4.1.2. 

Table 4.1.3 gives two numerical illustrations. In both cases epistasis 
and dominance are complete. In the left the dominant genes are comple- 
mentary and in the right the dominant genes are duplicate, representing 


Table 4.1.3. Numerical examples of two contrasting directions of epistatic devia- 
tions, On the left the dominant alleles are complementary; on the right, they are 
duplicate. These lead to the classical 9:7 and 15:1 Mendelian ratios. The gene 
frequencies are adjusted so that the two phenotypes are equally frequent in both 
examples; thus the deviations are equal and have been scaled to make the total 
variance 1. 








AA Aa aa AA Aa aa 
BB 1 l -l1 BB 1 l 1 
Bb 1 1 — 1 Bb 1 | 1 
bb —1 —] —] bb 1 1 —1 
Pa =qp = .459 Da =qs =.159 
V, = .582 V, = .757 
V, = .247 Va = .072 
V, =.172 V, =.172 


V, = 1.000 V, = 1.000 
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extreme cases of diminishing and reinforcing epistasis. We have assumed that 
the two phenotypes are equally frequent. This necessitates that the dominant 
allele frequencies are 0.459 and 0.159 in the left and right tables, respectively. 

Notice that in both cases, despite the complete epistasis, only 17% of 
the total variance appears in the epistatic term. The ratio of the additive or 
genic to the dominance variance is larger in the second case. This accords with 
the results in Table 4.1.1; the dominance component decreases as the domi- 
nant gene frequency decreases. 

These examples are intended to illustrate the underlying principles. 
In practice one is dealing with quantitative traits such as size, weight, or 
fitness, and the effect of individual genes cannot be ascertained. What is 
observed is a set of cumulative effects of many genes reflected in correlations 
or covariances between relatives. From these the variance components can 
often be inferred. 

The correlations between relatives will be considered later in the chapter, 
but we need to be able to subdivide the epistatic contributions further in order 
to study the correlations. This is necessary because the different epistatic 
components contribute differently to the covariances between individuals of 
different degrees of relationship. 

Again we consider only two loci with two alleles at each locus. The 
epistasis may be broken down into interaction of the additive or genic 
components at the two loci, interaction between the additive component of 
one locus and the dominance component of the other, and interaction between 
the dominance components. We shall designate these as V 44, Vap, and Vpp. 
Thus 


V; = Vaat Vip + Vpp - 4.1.18 


The theory for such subdivision follows the principles of factorial 
experimental design (Fisher 1935) and is described by Kempthorne (1957). 
We shall not attempt a proof; but will show a simple procedure for obtain- 
ing these quantities. It is simpler to rearrange the phenotype values as in 
Table 4.1.4. Remember that the a's are deviations from the population mean. 
The quantities in parentheses are the weighted means of the two immediately 
adjacent values. For example, 


0,1.,,2 = Pr4yy142 + P200212- 


Likewise the values along the bottom are the means of the columns. For 
example, 


d, 4. = Pigi + P29141211 + P1d201112 + P242401212- 
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Table 4.1.4. Basic calculations for subdividing the 2-allele, 2-locus epistasis into 
additive x additive, additive x dominance, and dominance x dominance com- 
ponents. The gametes and their frequencies are given at the upper and left margins 
and the phenotypes in the center. Values in parentheses are the weighted means of 
the two immediately adjacent values. All values are measured as deviations from 
the population mean. 


GAMETES 
AB, Ai B2 A2B, A2 B2 
P1q2 P142 P2q1 D2da 
A1B 
d Q1111 (2111.) A1112 A1211 (ai21.) A1212 
P1ıqı 
(ai.11) (ai.12) (@2.11) (@2.12) 
A2B 
oe Q1211 (a121.) 01212 02211 (0221.) 02212 
D241 












Ai1B; 


P1q2 
= ats (41.12) (21.22) 
A2B2 


P2qQ2 


Q1112 (08112.) Q1122 A1212 (aı22.) A1222 
(a2.12) (22.22) 


01212 (2122.) 01222 A2212 (a222.) A2222 


X = 4(piqia?.1. + p1q201.2. + paqdiad.1. + paq2 01.2.) 


Y 22(piqiaiti. + pip2qiai3i. + -` + píqa asia. + paqí a722) 


Now, we can put the various formulae together. Some are from Table 
4.1.2 and the rest from 4.1.4. 


U= V,» 
W = V, + V, 
X =2V,+ Vaa,» 4.1.19 


Y 23V, + 2V; + 2V4, + Vip» 


Z=V,+ Va + Via + Vip t+ Vpp = V4. 
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From these we readily obtain 


V, - U, 

V,= —U+W, 

Van = —2U4+ X, 4.1.20 
Vıp = 3U - 2W — 2X + Y, 


The proof of these relations and extensions to multiple alleles and mul- 
tiple loci are given by Kempthorne (1954, 1955). 

An example is given in Table 4.1.5. Again, complementary dominant 
genes are assumed, but this time we have let the gene frequencies be 1/2 for 
each allele. 


Table 4.1.5. A numerical example; two loci, two alleles at each, complete domi- 
nance, complete complementary epistasis. The phenotypic values are given at the 
left, the deviations from the population mean at the right. The gene frequencies 
are all equal to 1/2. 





AA Aa aa AA Aa aa 
BB 101 101 100 BB 7/16 7/16 —9/16 
Bb 101 101 100 Bb 7/16 7/16 —9/16 
bb 100 100 100 bb — 9/16 —9/16 —9/16 
VARIANCE FRACTION OF 


COMPONENT TOTAL VARIANCE 


V, .1406 571 
V, .0703 .286 
Vaa .0156 .064 
Va .0156 .064 }.143 


y .0039 .016 
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4.2 Variance Components with Dominance and Inbreeding 


In Section 3.10 of the previous chapter we showed that the genotypic variance 
increases if there are consanguineous matings within the population. We are 
referring to the whole population, not to a subpopulation if the inbreeding 
is such as to break the population into groups. Equation 3.10.3 is, 


V, — (1— f)Vo + fV, - fü — XY, — Y, 4.2.1 


where V, and V, are the variance of a randomly mating( f = 0) and completely 
inbred population (f= 1). Yo and Y, are the corresponding means. Unless the 
two means are the same, which would be true if there were no dominance, 
the variance is a quadratic function of f. 

We now inquire as to how the genic and dominance components change 
with f. We generalize the model to include inbreeding. 


GENOTYPE A Aı A, A2 A2 Az 
FREQUENCY pit pupa f 2pipx(1 — f) pit piprf 
GENOTYPIC VALUE Yu =ã+ anı Yı2 =å + Qh Y22 — d + a22 
GENIC VALUE Āā- 2a, ā+ aitaz d+ 22i 


Following exactly the same procedure as before, we see that again 
Pi% + p2% = 0. The genic variance is 


V, = 2p pL; — 25) (1 + f). 4.2.2 


This, which corresponds to 4.1.6, would seem to imply that as the population 
is inbred the genic variance is simply multiplied by | + /—as is true for a gene 
with no dominance (see 3.10.6). But this is misleading, for the o's also change 
as the genotype frequencies change with inbreeding. What we need is an 
expression for V, in terms of quantities that do not change with f. 

Continuing, if we go through the same least squares process as before, 
we have 


Q= (pi + pipa fay, — 224)? + 2p,px(1 — faiz — a, — a2)? 
+ (p2 + pipa / (a2; — 223)". 


ð 
= —4[(pi + pıPp2fXaiı — 204) + pipy(1 — f)(ay2 — 0, — 2;)] = 0, 


oo 


De = —4[p,pr(1 — faiz — à, — a3) + (p? + PiPof (a2 — 2a2)] = 0. 
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From these equations we solve for o, and a, getting 


lf 
and 
_ pid — fay. + (P2 + p1f)a22 
e = , 4.2.4 
1l+f 
Substituting these into 4.2.2 gives 
y= 2pipo[Cpi + fP2)(@i1 — a12) + (P2 + SPi)(@12 — 222] 
7 A Papo 6L 123 


This shows that the genic variance does not change in a linear way with 
f. Notice that a,,;, a,2, and a5; are measured as deviations from a mean a. 
But @ is not constant, since it changes with f. Yet the quantities (a4, — a,;) 
and (a, — a22) do not change, since they are the same as Y,, — Y22 and Y, 
— Y,,, which are constant. 

Note that when a,, — 412 = 44; — 422 (no dominance), 4.2.5 becomes 


V, = 2p, pay — a2) (1 +f), 4.2.6 
linear in f as expected (compare 3.10.6). 

We have not given an explicit formula for V, comparable to 4.2.5 for 
V,. This can be calculated from 4.2.1. Let 

Yar — Yi2 = 1 — 2 = A, 

Yi2 — Yoo = 412 — 422 = B. 


Then 
Vo = 2pipY(P1A + p2 B) + pip2(A — B)’, 42.7 
V, = pı P(A + By, 4.2.8 
Y, — Y, = —pip2(A — B). 4.2.9 


These may be substituted directly into 4.2.1 which then gives V, for any value 
of f. The heritability is given by the ratio of 4.2.5. to 4.2.1. 

Equation 4.2.7 was obtained from 4.1.10 and 4.1.11; 4.2.8 and 4.2.9 were 
calculated from the table at the beginning of this section. 

Notice that when f = 1, the genic variance becomes the same as the total 
variance and the dominance variance disappears. This is not surprising, 
for the dominance variance depends on the extent to which the heterozygote 
departs from the mean of the homozygotes; with no heterozygotes it has no 
meaning. Heritability increases with inbreeding because, although the total 
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Vo 


(0) 0.2 0.4 0.6 0.8 1.0 
1 
Figure 4.2.1. The changes in variance components 
with inbreeding. Notice that the dominance variance, 
which is the distance between the two lines, decreases 
to 0 as f approaches 1. Complete dominance is 


assumed, with the recessive gene frequency equal to 
0.333. 


variance increases, the genic variance increases more rapidly and becomes 
a larger fraction of the total. This is illustrated in Figure 4.2.1. 


4.3 Identity Relations Between Relatives 


We introduced in Section 3.2 the coefficients of inbreeding and of con- 
sanguinity. These measure the probability that two homologous genes drawn 
at random from an individual or from each of two individuals are identical 
by descent. We wish now to extend the ideas to measure different identity 
relations among diploid individuals who may be identical for neither, for one, 
or for both of their genes. 

The method that we are using was first developed by Cotterman (1940). 
Cotterman worked only with the relationships between two individuals, 
neither of which was inbred. The extension of the method to include the re- 
lationships between two inbred individuals has been made by Denniston 
(1967). We shall consider only the simpler case where neither of the two 
individuals is inbred. 

Consider two related individuals, / and J. We define the Cotterman 
k-coefficients as: 


kg — the probability that no two genes at the locus are identical, 

2k, — the probability that one gene in / is identical to one gene in J, but 
not both, f 

k, = the probability that both genes in / are identical to those in J. 
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For example, 


GENOTYPE k-PROBABILITY 
I J 
Adı Adı ki 
A142 AiÁ3 kı 
Adı AsA2 kı 
Adı A3Aa ko 


More precisely, if a and b are the two genes in 7 and c and d are the two 
in J, as shown in Figure 4.3.1, then (using = to mean ** are identical by de- 
scent ") the k-coefficients are: 

k, = Prob[(a = c) and (b = d)] or [(a = d) and (b= c)], 


2k, = Prob[(a = c) and (b # d)] or [(a = d) and (b z c)] 
or [(b = c) and (a # d)] or [(b = d) and (a £ c)], 
ky; = Probla#c,a#d,b#c,andb#d]. 


4.3.1 


Notice that, if either or both of the two individuals J and J are inbred, 
there are other possible relations, such as a= b=c#d ora=b=c=d., 
But without inbreeding of J or J the only possibilities are those measured by 
ko, ky, and k,. 


Figure 4.3.1. A diagram to show gene-identity 
relationships. The small letters refer to the genes 
in the gametes produced by the individuals 
designated by large letters. The curved arrows 


indicate possible consanguinity farther back in 
the pedigree. 
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To compute these k-coefficients we make use of the consanguinity 
coefficients of the parents. Referring again to Figure 4.3.1, we observe that 


kı = fac Jao + fant acs 4.3.2 


where f is the coefficient of consanguinity as defined in Chapter 3, Section 3. 
This follows immediately since the four f’s are the probabilities that (a = c), 


(b = d), (a 2 d), and (b = c). 


Furthermore, 

2k, = fack! — fap) + faoll — fac) + Sacel! — fan) ^ faoll — Sac) 
= fac + fap + fac +fav — U Sac fap + Sav f sc) 4.3.3 
= 4f, — 2k}. 


We can obtain kọ by substraction since kọ + 2k, +k, — Il. 
As examples, consider the three sets of relationships shown in Figure 
4.3.2. With ordinary single cousins 


Jsc = 1/4, 
Sac = fap =f sp = 0, 
kz =0, 
2k, m Afis = 1/4, 
With double first cousins 
fac = fap = 1/4, 
Jec = fap = 0, 
k, = 1/16, 
2k, = 6/16, 
kg — 9/16. 


QO OO) SO T 
NY VV VY 


D X © 


Figure 4.3.2. Single first cousins, double first cousins, and quadruple half-first 
cousins. 
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Finally, with quadruple half-first cousins, 


Sac =Sav =S sc = fap = 1/8, 


k, — 1/32, 
2k, = 14/32, 
ko = 17/32. 


The coefficient of consanguinity fr; = (k, + k2)/2 = 1/8, the same as for 
double first cousins, as expected. 


NÁ 








Figure 4.3.3. Full sibs, and parent and offspring. 


For the k-coefficients of sibs we treat the pedigree as if two of the 
ancestors were collapsed into one, as shown in Figure 4.3.3, and write the 
f'sin terms of the gametes. Thus 


Jaz = foa = 1/2, 


faa = frc = 0, 
and 
k, = 1/4, 
2k, = 1/2, 
ko = 1/4. 


There is also a slight complication if one individual is an ancestor of the 
other, as in the parent-offspring relationship shown in Figure 4.3.3. In this 
case, we have drawn the relevant gametes; this makes the situation clear. 
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Jac — fo, = 1/2, 


Soa = Joa = 9, 
k, =0, 
2k, = 1, 
ky — 0. 


The results are as would be expected. A parent and child who are otherwise 
unrelated must share one and only one gene at a locus. 

As stated earlier, we assume that neither Z nor J is inbred. However, it is 
all right for other individuals in the pedigree to be inbred. The same rules 
apply as in the computation of the inbreeding coefficient under ordinary 
circumstances. Inbreeding is irrelevant for all members of a path except for 
the common ancestor; if the common ancestor is inbred the path is multi- 
plied by 1 + f, just as in computing the inbreeding and consanguinity coeffi- 
cients. 

Likewise the rules for X-linked traits are still good. Males in a path are 
not counted and any path with successive males is ignored. Of course the 
k-coefficients, since they apply to diploids, are meaningful only for females. 

The k coefficients are of particular use in two contexts. One is for the 
computation of the solution to such problems as: Given that Z and J are 
related and Z is genotype aa, what is the probability that J is also? Suppose 
that p is the gene frequency. The probability, then, is k, + 2k, p + kop’. 
For example if J and J are double first cousins and Z has a recessive disease of 
incidence p?, then the probability of J being affected is (1 + 6p + 9p?)/16. 

The second use is in determining the correlation between relatives when 
there is dominance. That is the subject of the next section. 


4.4 Correlation Between Relatives 


The procedure for determining the correlations between relatives, with domi- 
nance, but restricted to individuals that are not inbred is now given. As in the 
previous section it is all right for other individuals in the pedigree to be inbred, 
but not for the two individuals under consideration. The necessary calcula- 
tions are set forth in Table 4.4.1. 
The covariance of X and Y is 
Cyy = ko(pia?, + 4pipz 411015 + Apipiai; + 2p1p2011022 
+ 4pip2412022  p242;) 
+ 2k.(piaii + 2pip2didi2 
+ PıP2012 + 2pip2012022 + pla2;) 
+ k;(piai, + 2pipzaia + p2a22)- 
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Table 4.4.1. The calculation of the correlation between relatives in terms of gene 
effects, gene frequencies, and k-coefficients. The values a,, are assumed to be meas- 
ured as deviations from the mean to simplify the arithmetic. 


MM MM LLL L1 





GENOTVPES PHENOTYPIC FREQUENCY OF THIS COMBINATION 
VALUES 
X Y X Y 


DNEI——————— ————— 


Adı Ai Aı dii ai kopt + 2kipi + kapi 


AA AA au 2) 2D 2ptp; + 2kaptpi] 
Ai42 A Aı d12 aii 


A142 A1A2 Q12 aı2 ko 4pipi + 2kipipa + k22pip2 


A Aı A2 A4: ait A ko pp] 

A242 A14: 422 aii 

A142 A2 A2 a2 2 ake 2pip? + 2k n 
A2 A2 A142 a22 Qi2 [ko 2pip: 1Pip2] 
Ax,A2, Aida a22 a22 kopf + 2kip + ka pi 


IMMENSE i. t LL 


But, the coefficient of kọ is 
(pia, + 2p1p2 21; + P2422)’, 

which, by 4.1.2, is 0. The coefficient of k, is equivalent to 
2[p.(p10i1 + P2 4:2)?  Po(pitio + P2422)°], 

which, from 4.1.9, is 
2(piet + p202), 


which, by 4.1.5, is V,. Finally, the coefficient of k, is V,, from 4.1.4. 
Therefore, summing over all loci, 


Cyy 2 kV, +k: V, 
or, since V, = V, + V, in the absence of epistatic interaction, 
Cyy = (ky  kK3)V, + kz Vy. vt 
From the definition of the correlation coefficient (A.6.3) the correlation 
between two relatives, X and Y, neither of which is inbred, is 


V, V, 
r = k 3 k )2 + k —, 4.4.2 
XY ( 1 2 V, 2 V, 


Essentially the same results were derived by quite a different procedure 
by Fisher (1918). 
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As a numerical example, consider some correlations on human height. 
Fisher estimated V/V, — .74 and Vj/V, — 26. He found no appreciable 
environmental component in the population studied. From this, the correla- 
tion between parent and child, if marriages were random with respect to 
height, would be 


r = 4(.74) = .37 
and, for sibs, 
r = 1(.74) +}(.26) = .44. 


Actually, as we shall see later in this chapter, the correlations are con- 
siderably increased by the strong assortative marriage for height. 

Notice that if there is no dominance, epistasis, or environmental effect, 
then the correlation becomes simply 


ryy = xy, 4.4.3 


where f is the consanguinity coefficient. 

The correlation is the covariance divided by the geometric mean of the 
two variances. For additively acting genes (no dominance) the effect of the 
two sets of genes in the zygote is the sum of the two haploid gametic sets. 
The covariance of w + x with y + zis not influenced by the correlation between 
w and x or that between y and z. On the other hand, the genotypic variance 
does increase with such a correlation, and in fact is multiplied by 1 +J, 
where f is the inbreeding coefficient (or the correlation between alleles). 
This was shown in equation 3.10.6. Thus, if there is inbreeding, the denomin- 
ator is increased and the correlation between two inbred individuals becomes 


— NN 
JO x SO fy) 
when there is no dominance, epistasis, or environmental effect. 
Notice, by comparing with 3.3.1, that this is Wright’s coefficient of 
relationship. In fact the original derivation of Wright's measure was through 
correlation analysis, and his intention was to have the relationship coefficient 
reflect the correlation between the gene values of the two individuals. 
The extension of 4.4.1 to include epistasis is straightforward, but we shall 


simply give the results rather than the derivations. For two loci, when 
epistasis is considered, 4.4.1 becomes 


4.4.4 


Cyy = (ky K3)V, + kz Va + (ki + ka) Vua 


4.4.5 
+ (ky + ki); Van + KT Vpp. 
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The extension to more than two loci is direct. For an epistatic interaction 
between additive effects at r loci and dominance effects at s loci, the coefficient 
Is 


(ki + k;yk,. 4.4.6 


Some examples are given in Table 4.4.2. 


Table 4.4.2. Covariances between relatives of different degree in terms of variance 
components in a population mating at random. These quantities, when divided by 
the total (phenotypic) variance, V, , give the correlations. 


RELATIONSHIP COVARIANCE 
Sib IV, +4Va + 4Van+ lVao + Voo 
Parent-offspring iV, + 4Vaa 
Half-sibs, Uncle-niece, 
Parent-grandchild ly, + Vaa 
First cousins iy, HV aa 
Double first cousins XV, + 3s Va TeV aa t+ daVapo + 21Vop 


It is often of interest to ask for the covariance or correlation between the 
offspring and the average of the parents, or the mid-parent, P. Letting P,, 
stand for the measurement on the male parent, P, for that of the female 
parent, and O for that of the offspring, we obtain from the definition of the 
covariance (A.2.9) 


| E(2XP,, + P, — P, — PAO — 0) 


Crs N 
| 1X(P,—- P,(O— 0) 1X(P, — PAO- O) T 
E N 2 N 2h 
= Chos 


provided that the two sexes are equivalent. So, the covariance of offspring 
and mid-parent is the same as that between offspring and parent. 

On the other hand, the variance of P is 1/2 the variance of P„ or P, 
(if males and females are equally variable, and if they are independent as is 
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the case with random mating). The regression of offspring on the average of 
the parents is 
C, 
bop = a 


P 
= 2C,p/ Vp 4.4.8 
m (V, + Vaa) Ve- 


Such a formula can be used to predict the rate of improvement by selec- 
tion. The progeny are expected to deviate from the average by a fraction 
b,p of the amount by which the mid-parent deviates, or more formally (see 
A.4.5), 


O =P + b,;(Ê — P), 4.4.9 


where P is the population average. 
Returning to the example of Table 4.1.5, the correlations are as follows : 


Half-sibs .147 
Parent-offspring .302 
Full sibs .382 


If the epistatic factors are ignored, the half-sib correlation would be estimated 
as 1(.571) = .143, not very different. 

In practical problems, the breeder usually estimates the heritability and 
then uses this value as a guide to selection programs. His estimates usually 
come from various correlations between relatives. One of the most used 
measures of heritability is four times the half-sib correlation, particularly 
half-sibs with the same father and different mothers since this eliminates 
the confounding effects of a common uterine and early postnatal environment. 
Four times the half-sib correlation is .587. The correct prediction formula 
is bog = (V, + V44/2) V, , or .603. The error is about 2.5%! In this example the 
epistasis is quite large, since we have assumed completely complementary 
gene action. Yet it doesn't cause a very large error in heritability measure- 
ments or predictions based on these. It is these reasons, as well as the practical 
difficulty of measuring epistasis, that lead the breeder to ignore epistasis. 


Variance Within and Between Groups of Relatives If the popula- 
tion is broken up into a series of groups we can relate the variance between 
and within the groups to the correlation coefficient. It is simpler to regard 
the groups as of equal size, but the general theory does not depend on this. 
We think of the quantitative trait or measurement as made up of the sum of 
a series of additive and independent components. A particular individual, 
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the jth member of the ith group, has a measurement, y,,, which is the sum of 
an overall mean (p), a component common to all members of the group 
(b;) and an additional component (w;;) that is specific to the individual. 
Then if V, and V, are the variances of these quantities (the between-group 
and within-group variances), the correlation between members of a group is 


hk h 
= WX V, 4.4.10 
For an explanation of these relationships, see A.4.10-A.4.15. 
For example, the variance within families of full sibs is 
Vs =(1 —r)V,. 4.4.11 
From the information in Table 4.4.2, this is 
V, = 4V, + dV, + Van t $Vap t d$ Vop- 4.412 


There may also be environmental factors that are common to a sibship 
and others that differ for members of the sibship. Suppose that V, and V, 
are the environmental variance components within and between sibships; 
then the sib correlation is 


V, Ve 
r= ——. 
Y, 


4.4.13 


In animal and plant breeding experiments it is often possible to avoid 
such environmental correlations by randomization. In human genetics and 
any study of natural populations such difficulties are unavoidable. 


4.5 Comparison of Consanguineous and Assortative Mating 


Assortative mating means that mated pairs are more similar for some 
phenotypic trait than if they were chosen at random from the population. 
It may have either of two causes, or some combination of both. The tendency 
toward phenotypic similarity of mating pairs may be a direct consequence 
of genetic relationship. For example, in a subdivided populatien there will 
generally be a greater phenotypic similarity among the members of a sub- 
population because they share a common ancestry. The genetic consequences 
in this case are the same as those of inbreeding. 

On the other hand, there may be assortative mating based on similarity 
for some trait and any genetic relationship is solely a consequence of similar 
phenotypes. For example, there is a high correlation between husband and 
wife for height and intelligence, probably caused much more by nonrandom 
marriage associated with the traits themselves than by common ancestry. 
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There are also other situations. For example, there is a considerable 
correlation in arm length between husband and wife. This is probably 
a consequence of the fact that those factors, genetic and environmental, 
that increase height also increase the length of the arm. So, any assortative 
mating for height will be reflected in a similar assortative mating for arm 
length, diminished somewhat by the lack of perfect correlation between the 
two traits. 

Assortative mating is between individuals of similar phenotypes; 
inbreeding is between individuals of similar genotypes. Since individuals with 
similar phenotypes will usually be somewhat similar in their genotypes, we 
should expect assortative mating to have generally the same consequences 
as inbreeding. An excess of consanguineous matings in a population has two 
effects: (1) an increase in the average homozygosity and (2) an increase in the 
total population variance. Assortative mating would be expected to produce 
the same general kinds of results. 

In general, assortative mating causes less increase in homozygosity than 
inbreeding, especially if the trait is determined by several gene loci. On the 
other hand, assortative mating causes a large increase in the variance of a 
multifactorial trait, in contrast to that produced by a comparable amount of 
inbreeding. A further difference is that inbreeding affects all segregating loci, 
whereas assortative mating affects only those related to the trait involved. 

Pure assortative mating, like inbreeding, does not change the gene 
frequencies. We shall refer in this book to any situation where different 
genotypes make different contributions to the next generation through 
differential survival, mating patterns, or fertility, as selection. Only when all 
genotypes make the same average contribution will we regard it as pure 
inbreeding or assortative mating. But we shall see later in the chapter that with 
many assortative-mating systems, and even more so with disassortative- 
mating systems, there are differential contributions of different genotypes; 
so the effects of assortment are often confounded with selection. 

The variance-enhancing effect of assortative mating is apparent with a 
simple example. Suppose that an arbitrary quantitative trait is influenced 
by two loci without dominance. Let each gene with subscript 1 add one unit 
to the phenotype, whereas each gene with subscript 0 adds nothing. Then the 
genotype 4,4, B,B, represents one extreme phenotype and Ao Ao Bo B, the 
other, with 4,4; Bo By, 4140 B, By, and Ag Ao B; B, beingexactly intermediate. 
Inbreeding will increase the frequency of all four homozygous genotypes, 
A,A; B1B,, Ag Ao B4B,, 414; Bo Bo, and Ao Ao Bo Bo. This will increase the 
variance; in fact, it will exactly double the variance if the population is changed 
from random-mating proportions to complete homozygosity. 

On the other hand, with complete assortative mating, the population 
approaches a state where only the extreme homozygotes, 4,4,B,B, and 


CORRELATION BETWEEN RELATIVES AND ASSORTATIVE MATING 143 


Ao Ao Bo Bo, remain. This clearly causes a much greater enhancement of 
the variance, especially if the number of relevant loci is large. The 
variance increase with assortative mating has been shown experimentally 
in Nicotiana (Breese, 1956) and Drosophila (McBride and Robertson, 1963). 
Thelatter authors also found the expected decrease with disassortative mating 
and demonstrated that the rate of change under selection can be increased 
with assortative mating. 

With inbreeding there is no systematic change in the frequencies of the 
gamete types A,B,, A,;Bo, Ao B,, and Ag Bo. On the other hand, as the 
example shows, assortative mating causes a change in frequency of the gametic 
types, increasing two while decreasing the other two. So, another way of 
describing the effect of assortative mating and of understanding its vari- 
ance-enhancing effect is to note that it causes gametic phase (or linkage) 
disequilibrium. 

The simplest cases of assortative mating were worked out long ago. 
These involved mainly a single locus (Jennings, 1916; Wentworth and Remick, 
1916). One example of this work is the simple case of two alleles where each 
genotype mates strictly assortatively, that is, only with another individual 
of the same genotype. The genetic consequences are obviously exactly the 
same as with self-fertilization. Heterozygosity is reduced by half each genera- 
tion and the variance is eventually doubled. 

It might be thought from this example that assortative mating leads 
eventually to complete homozygosity, as do many forms of inbreeding, but 
this is not the case. Partial assortative mating, like partial self-fertilization, 
leads to an equilibrium level of heterozygosity other than zero. 

In the more general treatment of assortative mating two cases are of 
interest. At one extreme the individuals fall into two (or possibly more) 
discrete phenotypes with preference for mating within a phenotype. For ex- 
ample, deaf persons tend to marry others with the same trait. At the other 
extreme is a character like size, for which there is a correlation between mates, 
but the distribution of sizes is continuous and determined by multiple genetic 
and environmental factors. 

Before dealing with more complex multifactorial models, we shall first 
consider a single-locus trait. 


4.6 Assortative Mating for a Single Locus 


With inbreeding the choice of a mathematical model is clear from knowledge 
of the relationships and from the Mendelian mechanism. With assortative 
mating the choice is not so obvious, as a different behavior pattern can 
produce a different consequence. 
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We shall measure the degree of assortative mating by the product- 
moment correlation between the parents, r. For a quantitative trait the cor- 
relation coefficient is directly measurable. For qualitative traits we measure 
thecorrelation coefficient as the decrease in the proportion of matings between 
dissimilar phenotypes, divided by that proportion which is expected with 
random pairs. We consider two situations. 


Each Genotype with a Different Phenotype Assume thateach geno- 
type is distinct, the differences being determined by a series of alleles. No 
restriction is placed on the number of alleles. 

Assume that in each genotype a fraction r select mates of their own 
genotype while the remainder mate at random. In this system, perfect assorta- 
tive mating is equivalent to self-fertilization, so this model of imperfect 
assortative mating is formally equivalent to partial self-fertilization. This was 
considered in Section 3.8. From 3.8.19 the heterozygosity at equilibrium is 
given by 


] — 
H, = an r). 461 





2-r 


This result was first obtained by Wright (1921). Notice that complete 
homozygosity is not approached unless the assortative mating is complete 
(r 2 1). Otherwise, the population approaches a level of homozygosity which 
is equivalent to an inbreeding coefficient of f = r/(2 — r). 

We showed in Section 3.10 that when there is no dominance the variance 
is proportional to 1 f. Therefore, with partial assortative mating the 
population variance at equilibrium is 





Vo = Vo(1 f)- vlz d 2! 4.6.2 


where V, is the variance with random mating. 

For example, if r = 1/2, the equilibrium heterozygosity is reduced by 
1/3 and the variance is increased by the same fraction. Only if r = | does the 
population become homozygous, in which case the variance is eventually 
doubled. 

A more important example, especially in human genetics, is the case 
where dominance is complete. We now consider this. 


Complete Dominance We assume that there are only two alleles and, 
since dominance is complete, there are only two phenotypes. If there are more 
than two alleles it may be that they can be grouped into two sets as regards 
mating pattern. For example, it might be that the normal allele is dominant 
and that the wild types tend to mate among themselves leaving all the mutant 
types to mate with other mutants. 
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Let P, be the frequency of AA in generation t, 2Q, be the frequency of 
heterozygous Aa, and R, that of the homozygous recessive aa. Let r be the 
correlation between mating individuals; that is to say, a fraction r mate 
strictly assortatively and the rest mate at random with respect to the trait 
considered. 

To see the algebraic relationships we imagine the population as being 
divided into three groups: a randomly mating group comprising a fraction 
(I — r) of all matings and with the A gene frequency P + Q = p; a recessive 
assortatively mating group making up a fraction rR of matings and with the 
A gene frequency 0; a dominant assortative group comprising a fraction 
r(1 — R) and with the A gene frequency (P + Q)/(1 — R) or p/(1 — R) and 
recessive gene (a) frequency Q/(I — R). From the randomly mated group the 
fraction of AA, Aa, and aa progeny will be p°, 2pq, and q?, where q = 1 — p. 
The contribution of the dominant assortative group to the AA class next 
generation will be 


r(l — R)Ep/(! — R)I’, 
to the Aa class will be 

r(| — R2[p/( — R)ILE/C — R)], 
and to the aa class will be 

r(1 — REQ — RY}. 


The recessive assortative-mating group will make its entire contribution, rR, 
to the aa class. 
Putting all this together the genotype frequencies next generation will be 





P(AA) = Pry = (1 = Dp ra = R)[ : | 











1—R, 
4.6.3 
=(1= npe LE 
2 r)p 1—R,’ 
P(Aa) = 20,4, = (1 — r)2p4 + (0 - RA —— L 
i UTI—R,I—R, 
4.6.4 
2rpQ, 
—-2(1-r + ; 
( )P4 "PX 
3 Q, V 
P(aa) = R,,, = (1 —r)q* + rR, + r(1 — R) I-R 
d 4.6.5 


q? + R(p— 2 


pM 
—(1-r)q er 1— R, 
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We have written p and q with no subscripts, since they do not change 
with time. This can be verified by summing 4.6.3 and half of 4.6.4. Recalling 
that p 2 P + Q and q = Q + R, this simplifies to p,,, — p,, showing that 
the gene frequency does not change. As with inbreeding only the genotype 
frequencies change, not the gene frequencies. 

When assortative mating is complete (r — 1), 4.6.4 becomes 


2pQ 
20, +17 - : 
p 





This approaches 0 as t increases, but extremely slowly. When p = 1/2, then 
20, = 1/2, and the frequency of heterozygotes in successive generations 
follows the simple harmonic series, 1/2, 1/3, 1/4, 1/5, ..., as first shown by 
Jennings (1916). 

With any value of r except 1 the population never attains complete 
homozygosity but approaches an equilibrium. We can find the equilibrium 
heterozygosity by equating Q,,, to Q,, giving 


Ô? + p(l —r)O — pq —r) =0 4.6.6 


whose solution gives the equilibrium value, Q, in terms of the correlation 
between mates and the gene frequency. 

Alternatively, we can do as we did before and express the heterozygosity 
as a function of the inbreeding coefficient, f. Replacing Ô by pq(1 — f) in 
4.6.6 gives 


af? c (r—1—q-—rq)f* rq — 0. 4.6.7 


When the correlation, r, is equal to 1, there is complete homozygosity (f = 1). 
Otherwise, there is equilibrium at an intermediate value of the inbreeding 
coefficient, given by the solution of the above equation lying between 0 and 1. 
Notice that, contrary to the results with inbreeding, the equilibrium value of 
the inbreeding coefficient with assortative mating is a function of the gene 
frequencies. 

We note here that simply equating the frequency of heterozygotes in two 
successive generations does not prove that this equilibrium is actually 
approached, or that itis stable. That both of these are in fact true can be shown 
but the biological considerations make it quite clear that there must be a 
stable equilibrium, so we shall not demonstrate it more rigorously. 

Of considerable interest is the extent to which assortative mating increases 
the frequency of homozygotes for recessive genes. The equilibrium proportion 
of recessive homozygotes is given by equating R,,, and R,. This gives the 
quadratic 


R? — R(1 +g? — rp”) +g? =0 4.6.8 
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with the solution 


1-gq^-rp?— Ja + q? — rp’)? — 4q? 
ee 





R= 
Some numerical examples are given in Table 4.6.1. 


Table 4.6.1. Proportion of recessive homozygotes with assortative mating, for 
various values of the recessive-allelle frequency (g) and the degree of assortative 
mating (r). The values given are the proportion of recessive homozygotes after one 
generation of assortative mating and at equilibrium. 


RECESSIVE-ALLELE FREQUENCY, 4 





Several general conclusions emerge from examination of this table. 
First, with weak assortative mating there is little ultimate increase in homo- 
zygous-recessive genotypes, as seen in the values near the top of the table. 
However, the population goes a large fraction of the way to equilibrium in the 
first generation. On the other hand, as seen in the lower left part of the table, 
intensive assortative mating with a rare recessive gene can lead eventually to 
a considerable increase in recessive homozygotes, but this is approached very 
slowly. 

Notice that when r = 1, the solution to 4.6.9 is R = q. As with inbreeding, 
the proportion of recessive homozygotes approaches the gene frequency. 
This is expected, of course, since there has been no change in gene frequency 
during the process. On the other hand, the rate at which the genotypes change, 
and the change in effective inbreeding coefficient, /, depend on the gene fre- 
quency. 

Assortative mating is quite high for deafness and it might be thought 
that this is a major factor in increasing the incidence. It has been estimated 
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(Chung, Robison, and Morton, 1959) that there are at least 35 recessive genes, 
any one of which can cause deaf ness when homozygous, and with an average 
frequency of 0.002. Whatever the amount of assortative mating for deafness 
as a trait, it would be only about 1/35 of this amount for any one recessive 
gene—somewhat less because of other causes of deafness. Thus, even with 
strict assortative mating the incidence would not be increased by more than 
2% or 3%. However, the tendency might be enhanced if there were a tendency 
for consanguineous marriages among the deaf. 


4.7 Assortative Mating for a Simple Multifactorial Trait 


There is strong assortative marriage for height and intelligence in the human 
population. These traits are determined by a large number of genes and are 
also influenced by the environment. We should expect that, if there are several 
genes acting somewhat cumulatively to produce the trait, assortative mating 
for the entire trait would have a very small effect on any one locus. On the 
other hand, we would expect that there would be an enhancement of the 
variability, morethan with inbreeding. 

The enhancement of variability can be seen by a simple example. Suppose 
that the trait depends on two pairs of factors, such that each substitution 
of an allele with a subscript 1 for an allele with subscript O adds one unit to 
the phenotype, as follows: 


PHENOTYPE ON 


GENOTYPE ARBITRARY SCALE 
1 AsAi BiB, Y+4 
2 A141B,Bo, A1Ao BiB, Y+ 3 
3 A:i41Bo Bo, Ao Ao Bi Bı, As: Ao Bi Bo Y+2 
4 A:Ao Bo Bo, AoA o Bi Bo Y+1 
5 Ao Ao Bo Bo Y 


Inbreeding will increase all four homozygotes. On the other hand, 
assortative mating will increase only the two extreme types, 1 and 5. This is 
easily seen to be true for complete assortative mating, for the extreme types 
can produce progeny only like themselves. Therefore the occurrence of an 
extreme type is an irreversible process; or, in a different vocabulary, types 1 
and 5 represent absorbing barriers. 

Assume that A, and B, have the same frequency, p and g = 1 — p. Then 
the frequencies will be as follows: 
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EQUILIBRIUM FREQUENCY 


COMPLETE 
CODED RANDOM COMPLETE ASSORTATIVE 

TYPE PHENOTYPE MATING INBREEDING MATING 

1 2 p* p? P 

2 1 4p°q 0 0 

3 0 6p7q? 2pq 0 

4 —1 4pq* 0 0 

5 —2 q* q? q 


The mean phenotype, Y, is 2(p — q). The three variances are 


Random: V = p*(4) + 4p'q(1) + 4pg* (1) + 4*(4) — Y? = 4p9; 
Inbred: V = p*(4) + q7(4) — Y? = 8pq; 
Assortative: V = p(4) + q(4) — Y? = 16pq. 


The inbred variance is a confirmation of the principle given in Chapter 3 
(3.10.6) that without dominance or epistasis the variance when f = | is 
twice the variance when f = 0. This is true regardless of the number of factors 
involved in the trait. On the other hand, with assortative mating the variability 
increase depends on the number of factors. 

The procedure can be extended to any number of loci. This was first 
done by Wright (19215). We have modified his method somewhat, but follow 
thesame general idea. The procedure comes from Felsenstein (see Crow and 
Felsenstein, 1968). 

Consider a trait determined by n gene loci. At each locus is a gene with 
frequency p such that the substitution of this gene for its allele adds a constant 
amount a to the character under consideration. Later, this restriction to equal 
gene effects and equal frequencies at all loci will be removed. 

Let 


n — the (haploid) number of relevant gene loci, 

f= correlation in value of homologous genes, 

k — correlation of nonhomologous genes in the same gamete, 

l = correlation of nonhomologues in different gametes, 

m= correlation of homologues in different individuals, X and Y, 
m' — correlation of nonhomologues in different invididuals. 


These relations are shown in Figure 4.7.1. 
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Figure 4.7.1. Correlations between the values of genes in two 
parents, X and Y, and their progeny. The circles represent 
individual genes. Homologous genes are opposite each other and 
genes from the same gamete are in a single vertical column. 
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An individual gene has a variance pqa?, where q — 1 — p. This can be 
shown as follows: For convenience, let the value of one allele be « and the 
other 0, with frequencies p and q. The mean value is px + q0 = pa. The vari- 
ance, v, is p(a — pa)? + q(0 — px)? = pqx?. Likewise the covariance, cov , of 
two genes, each with the same variance, is the variance times the correlation 
coefficient. For example, the covariance of two homologous genes is pqa?f. 

We can write the variance of the total value of individual X as the sum 
of the variances of the component genes. Thus 


V(X) = Lo; + 2X cov;,, 4.7.1 


where i and j designate individual genes. 
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The variance of an individual gene, v;, is pqa? and there are 2n of them, 
so Lv, = 2npqa?. The covariance of a pair of alleles is pg fa?, and there are n 
pairs. The covariance between nonalleles from the same gamete is pgka? 
and there are n(n — 1) combinations. Likewise, there are n(n — 1) pairs of 
nonalleles in different gametes with covariance pg/a?. Putting all this together, 
we find that the variance of X at time t is 


V(X), = 2npqc? + 2npgf &* + 2n(n — l)pgk,a? + 2n(n — Vpgl, o? 


4.7.2 
= 2npqa? [1 +f, + (n — Vk, + 1)]. 
Likewise the covariance of X and Y is 
C(X, Y), = 4npqm, a? + 4n(n — 1)pqm;o?. 4.7.3 


If the assortative mating is based solely on the phenotype, rather than 
being a by-product of common ancestry of the mates, and the gene frequencies 
are the same for all loci, there is no more reason for alleles in mates to be 
alike than nonalleles. Therefore m, = m, and we can drop the prime in equa- 
tion 4.7.3, leading to 


C(X, Y), = 4n?pqm, a. 4.7.4 


From Figure 4.7.1 the following recurrence relations can be seen. 


faim, 4.7.5 
la,—mo-m, 4.7.6 
kai =(1—ce)k c, 4.7.7 


where c is the proportion of recombination between the two loci concerned. 
In this case it is an average of the recombination between all pairs of loci 
concerned with the character, and for man is very nearly 1/2, since most pairs 
of loci are unlinked. 

If r is the coefficient of correlation between the phenotypes of the two 
mates, X and Y, which have the same variance, the covariance is 


C(X, Y) 2 rV(X). 4.7.8 
Substituting into this from 4.7.2 and 4.7.3 gives 

4n^pqm,a? — r[1 +f, + (n — Yy(k, + L)]2npqa?. 4.9 
Now we substitute f,,, for m, (see 4.7.5) and f, for l, (4.7.5 and 4.7.6), which 


leads after some rearrangement to 


fa => [1 + nf, 4 (n — DA]. 4710 
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Using this and the relation 
ka= (1 o, + cf, 4.7.11 


(obtained from 4.7.6, 4.7.7, 4.7.8), we can compute f, for any generation f, 
given the starting values, f; and kọ, which would both be 0 for a randomly 
mating population in gametic phase or “linkage” equilibrium. 

At equilibrium there is no distinction between ¢ and ¢+ l, so using 
carets to designate equilibrium values, 

f=l=mek. 4.7.12 


Using these equilibrium relations 4.7.10 becomes 


Ec 4 nf t (n — 0f], 4.7.13 
2n 
leading to 
f = ————— 4.7.14 
2n(l—r) +r 


as first shown by Wright (1921). 


If nis large, f is small unless r is very nearly 1. This shows that, unless 
the number of loci is small or the degree of assortative mating is very intense, 
there is only a very slight increase in homozygosity. 


There is a much larger effect on the variance. From 4.7.2, substituting 
f, for |, from 4.7.5 and 4.7.6, 


V(X), = Vo[l + nf, + (n — Ik, 4.7.15 


where V, = 2npqa?, the variance with random mating and linkage equilibrium. 
At equilibrium under assortative mating, substituting into 4.7.15 from 4.7.14 
and 4.7.12, 


Vo 


1 4.7.16 
1- (1 — z) 
2n 


(Wright, 1921), or, for large n, 


P(X) = 


V 
V(Xx)= TEP approximately 4.7.17 
-=r 


As a numerical example, let r = 1/4, which is roughly the correlation in 
height between husbands and wives. The homozygosity is increased only 
trivially if n, the number of factors, is large. After one generation f = 1/8n and 
at equilibrium is 1/(6n + 1) or approximately 1/6n. On the other hand, the 
variance is increased by 1/8 in the first generation and eventually by 1/3. 
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4.8 Multiple Alleles, 
Unequal Gene Effects, 
and Unequal Gene Frequencies 


Still assuming no dominance and epistasis and no environmental effects, 
the assumption of only two alleles with equal effect and equal frequency 
will be dropped. 

Let a? be the variance of a gene at the ith locus. Thus o? = Ep, ak — M3, 
where p, and a, are the frequency and effect on the trait of the kth allele, M; is 
the mean effect of these alleles and the summation is over all alleles at the 
ith locus. The øs remain constant under assortative mating since the gene 
frequencies do not change. 

The covariance o,, between two genes is 0,6, r,, where r,, is the correla- 
tion between the two genes. The correlations f, k, |, m, and m’ of Figure 
4.7.1. are no longer constant for all pairs of genes. Equations 4.7.2 and 4.7.4 
can be written more generally as 


V(X) =2) 0} +2) a7 fh 22, kijo apne le 0,0j, 4.8.1 
i21 i i*j 
C (X, Y)=4) mj, 0, 0;. 4.8.2 
ij 


The recurrence relations 4.7.5, 4.7.6, and 4.7.7 still apply to individual 
gene pairs; that is, 


fina = Mii t» 4.8.3 
lj, ei T Mij, 4.8.4 
kij, ii l- Cu), pt Cijlij is 4.8.5 


so that at equilibrium 
J, = my, kiy Sly = mp 4.8.6 
Then at equilibrium 4.8.1 becomes 
P(X) 4 mio,7; - 2) mici +29,0; . 4.8.7 
ij i 7 


We now let 


_ ÈM; mij016j 4.8.8 


E Yumuol Mi? 
Substituting this into 4.8.2 and 4.8.7 gives 


P(X) = C(X, pss Vo 4.8.9 


n, 
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where the carets indicate equilibrium values and 
Vo = V(X), 2 2Y, o;. 4.8.10 
i 


Vo is V(X) before the assortative mating began (f=/=k=m=0 in 
4.8.1). Since C(X, Y) = rV(X), we get 


f(x)- d: 4.8.11 
1-r 


(1 — 12n,) 


At equilibrium the average inbreeding coefficient, weighted by the 
contribution of each locus to the variance, is 


24 mi o? 


f = CL, 4.8.12 


Substituting from 4.8.8, 4.8.2 and 4.8.10, 


- C(X,Y) 1 
e 2n V, 





o 


V(X) 


f 
= — 4.8.13 
2n, Vo 





E r 
~ 2nd —r)+r 


Comparing 4.8.11 and 4.8.13 with 4.7.16 and 4.7.14 shows the equivalence 
of n and n,. When m; = mj; = m, then from 4.8.8 


t 


and if each locus has the same standard deviation (c; = e, = a) then n, = 
n?c? na? = n. We therefore call n, the effective number of loci. It will be equal 
to the true number when there is free recombination and all loci contribute 
equally to the variance; otherwise it will be less. 

Notice that when n, — l, 4.8.13 gives 





r 
[52 4.8.15 


the value mentioned earlier when we discussed a single locus. Likewise 





P(X) = "(i : -). 4.8.16 
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The variance after one generation of assortative mating is readily derived. 
From equations 4.8.1, 4.8.2, 4.8.3, 4.8.4, and 4.8.10 we can write 


V(X), = Vo + 2 Y oF Mii, o + 2Y kij, 19,9; + 2); 0,9; ij o * 4.8.17 
i i*j ij 
But k;; , = 0, from 4.8.5. Thus 
V(X), = Vo +2}, mi;o;0; 
i, j 
= W + ł4C(X, Y)», 


from 4.8.2, and therefore 


2 
since C(X, Y) = rV(X). 
Table 4.8.1 gives numerical illustrations of the increase in homozygosity 


and variance after one generation of assortative mating and after equilibrium 
is reached. 


VX), = v1 * z), 4.8.18 


Table 4.8.1. Effect of assortative mating on the average inbreeding coefficient, f, 
of relevant genes and the variance of the trait, V. Subscripts 0, 1, and œ refer to the 
randomly mating population, the population after one generation of assortative 
mating, and at equilibrium under assortative mating. Other symbols are: ne = 
effective number of gene loci, r = correlation between mates, H = heritability. 


V, V. 

^s h fo Vo Vo 

-— 1 500 1,000 1.500 2.00 

H=1 4 125 1,000 1.500 8.00 

TEC 1 250 333 1.250 1.33 

H-1 4 063 A11 1.250 1.77 

d 0 0 1.250 2.00 

r& 325 1 125 143 1.125 1.14 

Hed d 031 040 1.125 1.28 

a 0 0 1.125 1.33 

r=.5 e 0 0 1.063* 1.21 
H=5 


* Exact only if V, — 0. 
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We have not considered the effects of disassortative mating, but there is 
nothing about these formulae that demands that r be positive. Disassortative 
mating has opposite effects, a decrease of homozygosity and variance and 
a building up of linkage disequilibrium in the opposite direction (i.e., an 
association in the same gamete of genes of opposite effect). 


4.9 Effect of Dominance and Environment 
In a randomly mating population the variance can be divided into components 
V, Js V, + V, qe V, 4.9.1 
or 
V, = V+ Ve; 4.9.2 


where V, is the total variance and V,, V,, Vp, and V, are the genic (additive 
genetic), dominance, genotypic (or total genetic), and environmental 
components. 

The equations above assume that the genetic and environmental factors 
are independent so that V, is simply additive to the other components. This 
is a major limitation to precise quantitative prediction of the phenotypic 
effects of assortative mating, particularly in human populations. We are also 
ignoring the effects of epistasis. Finally, all the results from here on are only 
approximate. 

According to Fisher (1918), assortative mating will increase V, , but not 
V, and V, . This is not surprising, since with multiple factors only genic effects 
contribute to the correlation between parent and offspring (Reeve, 1961). 
However, it is not strictly true, for V, does change. But, as noted earlier, with 
a large number of genes there is very little change in heterozygosity under 
assortative mating, and therefore V, is not expected to change very much. 

We let A be the correlation between the genic values of the mates. Thus, 

A= r= rH, 4.9.3 

V, 
where H is the heritability. This is the same as A? of Section 4.1. 
After one generation of assortative mating, we have approximately 


A 
V, = (1 + 2) +V,+ Vi 4.9.4 


from 4.8.18 after replacing r with A. Equation 4.8.18 is reasonably accurate 
if V, is small. Otherwise the factor by which V, is inflated may be appreciably 
in error. See Reeve (1961) for an exact expression for the 2-allele case. Equa- 
tion 4.8.18 is strictly correct only in absence of dominance. 
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At equilibrium under assortative mating 


uL ee 
A=r = ==rH. 4.9.5 


P, 


Substituting into 4.9.1 from 4.8.11 gives 


1 
952wxw[———lI-c-V-cV 
t 1 — AQ d e 


^ 


V, + "rs = vhi + s: 29) 4.9.6 


l 
where Q = | — 5 . 
Ne 








We have used A instead of r because only the genic part of the correlation 
contributes significantly to the variance of future populations. Furthermore, 
we use the equilibrium value of A, since even with constant r there will be 
changes in A as the composition of the population changes. 

The object is to express the population variance and the correlation be- 
tween relatives after equilibrium under assortative mating,in terms of quanti- 
ties that can be measured in the random-mating population before assortative 
mating began. To do this we must have a measure for A. Note first the identity 


1—H 
V-V-V, a 4.9.7 


which follows from the definition, H = V,/V,. But, since V, and V, do not 
change much with assortative mating, 


V,- V,- f, (5) = "(r—35) Ta 4.9.8 


Equating the right sides of 4.9.7 and 4.9.8 and recalling that H = A/r, we 
obtain after some algebraic rearrangement Fisher's equation for A, 


Q0 — H)? — A+ Hr — 0. 4.9.9 


H can be measured in the randomly mating population. Then, if Q is taken 
as | (i.e., the effective number of genes involved is assumed to be large, as it 
must be if other assumptions are to be correct), the equation can be solved 
for A, and this value put into 4.9.6 to give the equilibrium variance. 
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As an example, let H = .5, r 2.5, and Q=1. Solving for A gives 
(2 —,/2)/2 = .293, Then, from 4.9.6, 


.293 


so the population variance is increased after equilibrium under assortative 
mating of this degree by about 21 %. 

This value is given in the bottom row of Table 4.8.1, along with the 
increase in variance after one generation of assortative mating. As noted above 
the latter value especially may be a poor approximation if V, is large. 


4.10 Effect of Assortative Mating on the 
Correlation Between Relatives 


This was first done by Fisher (1918) and we follow his method. 

Consider first parent-offspring correlation. The correlation is Vj/2V, 
in a randomly mating population. With equilibrium under assortative 
mating this will change for two reasons. One is that the variances increase, 
so we must replace V, and V, with their equilibrium values. The other reason 
is that the correlation between the two parents will, to the extent that this is 
reflected in genetic differences, add to the correlation of offspring with one 
parent through influences acting through the other. 

If the chosen parent deviates by a unit amount from the population 
average, the other parent will deviate by r because of the correlation between 
the two mates. The mean deviation of the parents is thus (1 + r)/2, and the 
expected deviation of the children is the genic part of this, or DIP, times the 
parental mean deviation. Thus, the correlation between the chosen parent 
and the offspring at equilibrium under assortative mating is 


1? 
ê ae 8 1 .10. 
po TA +r), 4.10.1 


which in terms of the random-mating variances is 
1V,-V,R 

po = — 9. 98. (1 + 
2 V+ V, É 


as given by Fisher (1918). 
Fisher also gives the grandparent-child correlation as 


r), R = 1 T 4.10.2 





1+ri+A 


fpo = .10. 
5 5 4.10.3 


po 





TJ 
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and each additional descendant multiplies the correlation by (1 + A)/2, as 
expected, since only the genic component is transmitted and therefore A 
replaces r. 

With full sibs the problem is more complicated because there are also 
correlations between the dominance components. Recall first the correlation 
between sibs under random mating, which is 


Ls + LE 10.4 
ud. ae 4.10. 
o0 2 V, 4 : 
The variance within a sibship with parents chosen at random is 
V = AG = Too) 
1V, IY, 
=Vj1—-~-2-- z) 10. 
2y ay, 4.10.5 


=4V,+ iV V. 


However, Fisher notes that this is also a good approximation to the variance 
within a sibship when the parents are mated assortatively, since the variance 
within a sibship depends only on genes for which the parents are heterozygous 
and, as we have learned, with a large number of genes the heterozygosity is 
only slightly decreased by assortative mating. Considering now the popula- 
tion at equilibrium under assortative mating, the correlation is a measure of 
the reduction of the variance within a sibship. 
Thus 


JM tava, 
00 [4 E 


But, from (4.9.6), P, = V, + KV,. Making this substitution and rearranging 
we obtain 


1-? 


_ VK + 4) + 4M, 


^ 





o 77 5 ; 4.10.6 
V, + KV, 
where 
K= AQ ; 4.10.7 
1 — AQ 


A may be obtained from 4.9.9. Q is taken as l. 
Correlations for other relatives are given in Table 4.10.1. 
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Table 4.10.1. Correlations between relatives in a randomly mating population and 
in a population at equilibrium under assortative mating where r is the phenotypic 
correlation between mates, H = V,/V,, D=Va/V,, A = Hr. Equilibrium values 
under continued assortative mating are indicated by carets. The effective number of 
genes is assumed to be large, so that (2r. — 1)/2n. may be regarded as 1. 


Parent-offspring 4H 1A +r) 
Grandparent-offspring 4H 4A(1 + r\(1 + A) 
Great grandparent-offspring i LH +r) + A)? 
Sibs IH --iD 1H + ÁÀ)- 1b 
Double first cousins IH -FRdgD — iH c 3) sb 
Uncle-niece +H +A(1 + A)? 4+14DA 


First cousins iH(-- A)? + i5 bÂ 


Fisher applied these methods to data on human stature. The data 
(obtained from earlier studies by Pearson and Lee) show 


r = .2804, 
Fpo = -5066, 
Foo = 5433. 


From 4.10.1 we calculate the equilibrium heritability 


V, 
=£ = fA =.79! 
f , 
from which 
A = Ar = 222, 


assuming Q = I. 
Assuming the observed correlations represent equilibrium values we 
can ask what the heritability was before assortative mating began. 


V, = V1 — A), 
m Á 
"oda or 
V 
H = < = 24. 
V. 


So the assortative mating has increased the heritability from .74 to .79. 
From the sib correlation 4.10.6 we can estimate V,/V,, which turns out to be 
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about the same as | — A. Hence, on the basis of these data, Fisher con- 
cluded that environment is of very little importance in determining variance 
in human stature. 

The analysis of variance in a population at equilibrium under assortative 
mating would be 


V, 6296 
Va 21% 
V, 83% 
Effect of 

assortative mating 17% 
V, 100% 


Fisher assumed that the environmental similarity between sibs was no 
greater than that between parent and offspring. This seems quite dubious; 
it is probable that genes for height are less dominant than he thought and the 
environmental influence greater. 

To make it easier to go from this treatment to Fisher’s 1918 paper, here 
is a list of equivalents: 


V; T? P h G 
V, c? A A 
V, £g? Q l 

V/V, € r u 


4.11 Other Models of Assortative Mating 


The correlation model that we have been discussing may not always be 
realistic. [t is simple and natural, but of course there is an infinity of possible 
patterns of assortative-mating behavior. For quantitative traits the complexity 
may be such that it is not feasible to study more realistic models, except per- 
haps as special cases by computer simulation. Fortunately many traits of 
interest are normally distributed, or approximately normally, or may be 
transformed to be so, and the linear correlation and regression model works 
very well for most purposes. 

On the other hand, there has been considerable discussion in the literature 
of specific models of assortative mating for single-locus traits (O'Donald, 
1960a; Scudo, 1968; Parsons, 1962; Watterson, 1959; Scudo and Karlin, 
1969; Karlin and Scudo, 1969). 

As one considers the complexities of real populations there are many 
factors to take into account. The pattern may depend on whether the mating 
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is monogamous or promiscuous, on the sex ratio, on which sex exerts the 
preference, on the nature of the inheritance of the trait, and on many other 
variables. Another complication is that the mating pattern may lead to a 
greater fertility of some genotypes than others. In other words, there may be 
selection in addition to pure assortative mating. 

As stated in the beginning of this chapter, we shall ordinarily use the 
words assortative mating to describe pure assortative mating with no selec- 
tion; that 1s, each genotype has the same expectation of surviving and fertility. 
When this is not so we shall speak of assortative mating with selection. 

Even with this definition there will be difficulties in interpretation. For 
example, it may be that the same genotypes are more fertile in some mating 
combinations than others. In some instances it may be more convenient to 
designate the fertility of a mating combination than that of a genotype 
(see Bodmer, 1965). 

We shall consider only a few of the many examples that could be used, 
first uncomplicated by selection and later with selection included. 


Assortative-mating Models Without Selection We return first to 
the single locus with dominance, first discussed in Section 4.6. We assumed that 
the same level of preference existed among the recessive phenotypes as among 
the dominants, both measured by the correlation coefficient, r. But we would 
now like to be more general. For example, red-haired persons (or some 
red-haired persons) may prefer to marry others with red hair, but the rest of 
the population may have different preferences, or be indifferent to hair color. 
Consider the same model as before, but let the degree of assortment be r and 
r' among the recessives and dominants, instead of the same value for both. 
Again we assume that, after the designated fraction of assortative pairs is 
formed, the rest of the population mate at random. 
The matings will then occur in the following ratios: 


MATING FREQUENCY 
ASSORTATIVE RANDOM TOTAL 
A— x A— r'(1 — R) (1L — r' (1  R?/D IR 
A— xaa (1 — rYX1— r)Y1— R)RID 
aa x d— (I — rYX1—r)1— R)R'D R 
aa x aa rR (1 —r R?/D 
r' — Rír— r!) ] —r'— R(r — r^) l 


—1—2D =D 
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The equations corresponding to 4.6.3-4.6.5 can be obtained from this table, 
giving 


Warp? rp 











Pia = Ul. 
1 D TR 4.11.1 

21 — r)p[d — r)Q. - (0 — r)R]  2r’pQ, 
2Ü ipe ————É MÁS : 11. 
Qi p TET 4.11.2 

[1 — 7)Q. - (0 —r)R,]? r’'O? 

Re. = —— m l rR + 3 41. 
1 D em 4.11.3 


That the gene frequency does not change can be verified by adding 4.11.1 
and half of 4.11.2 (or 4.11.3 and half of 4.11.2). This again shows that pure 
assortative mating does not change the gene frequency. 

However, the final equilibrium and the rate of approach to this depend 
on r and r', as well as on the gene frequencies. The equilibrium value for Q 
is given by a cubic equation (see Scudo and Karlin, 1969), which of course 
reduces to 4.6.6 when r =r’. 

As an example, suppose a certain fraction of deaf persons attend common 
schools and tend therefore to marry assortatively. The rest, say, are educated 
in public schools and join the population presumed to be marrying at random 
with respect to this trait. Then r' will be approximately 0. 

In this case, the equilibrium equation corresponding to 4.6.8 and obtained 
by equating R,,, = R, =R is 


rR? — (1 —r + 2gr)R +g? =0, 4.11.4 
with the solution 


s lerk 2r = JU — r 4 qr)? — 4rq? 


R m 4.11.5 





When r is less than 1 there is an equilibrium set of genotype frequencies. 
The proportion of recessive homozygotes is somewhat less than if r = r', and 
the heterozygosity somewhat greater. When the mating preference is complete 
(r = 1), the population eventually becomes homozygous. Notice that in this 
case, 4.11.4 and 4.6.8 are equivalent, as they should be. If there is complete 
assortment within one phenotype, there must be within the other also. 


Assortative Mating with Selection Tocontinue with the same general 
model, suppose that those matings which are assortative differ in fertility 
from those which are random. This might happen, for example, if the 
assortment took place first; then the later random matings might have their 
fertility impaired by the delay. 
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As an example, suppose that in equations 4.11.1—4.11.3 the random 
matings have their fertility multiplied by a constant C, which may be greater 
or less than l. This is equivalent to replacing D by D/C, which we shall call 
K. However, the equality signs must now become proportionality signs, 
since the three equations no longer add up to unity. 

Consider first that r = r’. Then we can write, after some algebraic 
simplification (it will be helpful to recall that P, + Q, = p, and Q, + R, = 
q, = be pi), 

Pe Pray + Qu (l—r)*p+rpK p 


= = SS L, 4.11.6 
g, Q,+1 + Risi (1 m r)'4 + rqK q 


regardless of the value of K. 

In this case there is still no selection for, although different kinds of 
matings take place with different frequency, each genotype makes the same 
contribution of genes to the next generation. However, if r Ær’, then the 
relationship 4.11.6 is no longer true. The gene frequencies change and 
assortative mating is complicated by an inherent selection in the process. 

An interesting model that has been used for assortative mating with 
selection is the following. Suppose that matings occur at random but that 
disassortative matings are less fertile. This might happen if for some reason 
matings between unlike types were incompatible. We measure the extent of 
reduction in matings between different phenotypes by s. The model is specified 
in this way: 


PROGENY RATIOS 


MATING FREQUENCY RATIO 
AA Aa aa 
AA x AA P? P? 
AA x Aa 4PQ 2PQ 2PQ 
Aa x Aa 4Q? Q? 2Q? Q? 
AA X aa 2PR(1-s) 2PR(l-s) 
Aa x aa 4QR(I-s) 2Q R(1-s) 2QR(1-s) 
aa x aa R? R? 
Total 1 -2R(|L — R)s = D 
Collecting the progeny of each genotype, 

DP,,, =(P, + Q) =p, 4.11.7 
DQ,+ı = Q(P + Q,) + RP, + QI — s)=p.q,—p,R,5, 4.11.8 


DR, xx (Q, + Ry = 20, R, S= a; em Xy, E R,)R,5 . 4.11.9 
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Adding 4.11.7 and 4.11.8 gives 


Prat = Put Qua = nl es 4.11.10 

When R,« 1/2, p,41 > pi, and the dominant gene increases in frequency. 
Whether the quantity in brackets is greater or less than | determines whether 
the gene frequency increases or decreases. So there is a tendency for the popula- 
tion to move away from the point where the two phenotypes are equal 
(P + 20 = R = 1/2). The population tends to move toward fixation of which- 
ever phenotype was more common in the first place. We end up eventually 
with a homozygous population, and which type it is depends mainly on the 
initial gene frequencies. 


1 








Figure 4.11.1. The paths followed by 
populations under assortative mating of the 
type described by equations 4.11.7-4.11.9. 
There is an unstable equilibrium at the 

point 4, — 4, R = 1/2. Points corresponding to 
the Hardy-Weinberg proportions lie along 
the parabola, whose equation is 

P? —2PR+ RP? —2P — 2R + 1 — 0. When 

s is very small, the population tends to move 
quickly toward this curve and then proceed 
slowly along the curve to one or the other of 
the extreme points, P = 1l, or R = 1, depend- 
ing on which side of the horizonta! line, 

R = 1/2, it started from. 
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Actually the situation is quite complex. There is a point of unstable equi- 
librium when R = 1/2 and the equilibrium value of the gene frequency q is 
obtained by solving 4.11.9 when R = 1/2 and q, =q. Sometimes the gene 
frequencies will reach this equilibrium, but this point is unstable and the 
slightest displacement starts the process off toward fixation at one of the two 
extremes of fixation of the dominant or the recessive allele. 

A discussion of this case has been given by Scudo and Karlin (1969). 
We have simply sketched the general picture in Figure 4.11.1. This shows the 
behavior when s is small. 


4.12 Disassortative-mating and Self-sterility Systems 


There has been much less attention paid to disassortative- than to assortative- 
mating systems, for the very good reason that with the great majority of 
traits the mating system, if departures from randomness occur, is more likely 
to be assortative than disassortative. There are, however, some conspicuous 
exceptions—one is the ordinary system of biparental reproduction, which may 
be regarded as an example of disassortative mating. 

Even more than with assortative mating, disassortative tends to be con- 
founded with selection. It is typically accompanied by gene frequency changes. 
In fact, it may be impossible to have strongly disassortative mating without 
selection. From the simple example of a population with 90% of one pheno- 
type and 10% of the other, we can see that there is an obvious upper limit 
to the number of disassortative pairs that can occur. The more common 
type tends to get left out. 

We shall consider two examples, both of which involve a mixture of 
disassortative mating and selection. 


Self-sterility Alleles in Plants A number of plant species have this 
kind of system. The rule is that pollen is functional only on a plant, neither 
of whose two alleles at this locus is the same as that of the pollen. The system 
we are considering depends entirely on the haploid genotype of the pollen 
itself and not on the plant that produced it, although there are examples where 
the determination is by the genotype of the plant rather than the individual 
pollen. 

The self-sterility allele system obviously prevents self-fertilization since 
neither of the two types of pollen produced by a plant can function on that 
plant. It is also clear that the system would work best in preventing self- 
fertilization, while permitting cross-fertilization, if there were a large number 
of alleles, so that a randomly chosen pollen is not likely to share its gene with 
the plant on which it lands. It is conventional to designate the alleles at this 
locus with the letter S, with individual alleles indicated by subscripts. 
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As an example, S, pollen would function on S, S,, Sa S4, or Sj, Ssg 
plants, but not on S,S; or S,S,;. An immediate consequence of the system is 
that every plant is heterozygous. Thus, the total frequency of all genotypes 
carrying one S; allele is 2p;. 

We shall make two assumptions, both reasonable. One is that pollination 
is random. The second is that enough pollen is produced that every ovule 
has an equal chance of being fertilized. When there is no selection between 
different ovules, then each allele transmitted through the female has an equal 
chance. 

The success of a pollen grain of genotype S; depends on its falling on a 
plant not carrying this allele, and thus is proportional to | — 2p;. Thus the 
proportion of S; pollen among all successful pollen will be p;(1 — 2p;)/ 
Yp,(— 2p,). But the denominator is Xp; — 2Xp? = | — 2Xp2, since the gene 
frequencies must add up to 1. Thus, dropping subscripts for simplicity of 
notation, the change in allele frequency due to pollen selection is in one 
generation 


p—2p ^ 2p(p- X) 

122x P 1-2’ 
where 

X = Lp’. 


But, there is no selection on ovules; all the selection is through the pollen. 
Thus, since the genes contributed to the next generation come equally from 
the male and female parents, the total change in gene frequency is only half 
as great. So, we write 


| —P(p— X) 


Ap = po X =p’. 4.12.1 


This formulation is an excellent approximation, but it is not exact. It 
fails to take into account the exact nature of the other pollen grains with 
which any particular grain is competing on a particular stigma. For example, 
if most of the other grains carry one of the two alleles that the female has, 
this particular pollen has a better chance of being the successful one. However, 
if the number of alleles is large, most of the competing pollen will also fail 
to have an allele corresponding to that of the stigma on which they are com- 
peting, so that this factor can be ignored. 

The more exact formulation has been given by Fisher (1958). It is also 
discussed by Moran (1962) and a rather similar approach was made by 
Wright (1939). The exact expression is in fact quite intractible, and the solu- 
tions obtained have been by approximations such as the one we have just 
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given. Our formulation gives the correct equilibrium value, and the correct 
rate of approach unless the number of alleles is small or their frequencies 
grossly unequal. 

It is clear from 4.12.1 that the frequency of an allele will increase if it is 
less than X, and will decrease if it is greater. The same is true of all alleles. 
Hence each allele frequency approaches the same value, X, and there is a 
stable equilibrium when all alleles are of the same frequency. If there are n 
alleles, then the equilibrium frequency of any one is 


p; ln. 4.12.2 


Notice that equation 4.12.1 is equal to 0 when p; = 1/n for any number of 
alleles. That is, if one or more alleles have frequency O the rest again approach 
equality. Such a system will lead to the maximum numberof alleles maintained. 
for every new mutant will tend to increase and, aside from loss due to sam- 
pling accidents, will be incorporated into the population, whereupon a new 
equilibrium is approached, with p; = 1/(n + 1). 

It is no surprise then that actual investigations have revealed a very large 
number of such alleles persisting in plant populations. The only force tending 
to reduce the number of alleles (other than some limitation on the total range 
of mutational possibility) is the accidental loss of alleles from random changes. 
This problem has been discussed in great detail by Wright (1938, 1964, 1965), 
Fisher (1958), and others. We shall consider such random processes in the 
last three chapters. 


Disassortative Mating; One Locus, Two Alleles We shall consider 
the problem briefly, and only for two alleles. The problem of disassortative 
mating for more than two alleles is quite complex (see Finney, 1952; Moran, 
1962). 

Imagine first a very simple case where the only matings are between 
AA x aa and Aa x aa. It is obvious that, after the first generation, there will 
be no more AA homozygotes and the only remaining matings are Aa x aa. 
This produces two kinds of progeny, like the parents, and in equal propor- 
tions. The equilibrium is immediately stable. 

This situation is found in some plants where the dominant gene causes 
short style and the homozygous recessive is long. Fertilization normally 
occurs only between two different types. This is clearly expected to lead to a 
stable 1:1 polymorphism. A more familiar example is the ordinary sex- 
determining system in which all matings are X X by XY, again leading to a 
stable 1:1 sex-polymorphism. 

Consider next a slightly more complicated example, and one with a rather 
interesting consequence. This time we still consider only two alleles, but each 
genotype is regarded as different. The rule is that each genotype can mate 
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with any genotype but its own; otherwise mating is random. If, as usual, 
we let P, 20 = H, and R stand for the frequencies of the three genotypes AA, 
Aa, and aa, we can set forth the various possible matings as follows: 


PROGENY RATIO 


FREQUENCY 
MATING RATIO AA Aa aa 
AA x Aa 2PH PH PH 
AA X aa 2PR 2PR 
Aa x aa 2HR HR HR 
Total D=1 — P? — H?— R? 


The recurrence relations are easily written 


Pa = P,H,/D,, 


4.12.3 
Ry at = R,H,/D,, 
from which 
Paw odes TT 
Ris 1 R, )J a 
showing that the ratio of the two homozygotes does not change. 
On the other hand, 
Ry 1 P, +1 H, 
Im uuu NP alid 4.12.5 





and 
D, = 1 — P? — H? — R?. 


The homozygotes increase when H > D and decrease when H < D. There is 
a stationary state when H = D, for then the genotypes have no tendency 
to change frequency. Setting H = D, and dropping subscripts since this is an 
equilibrium, gives the ellipse, 


2P? + 2PR+2R?—3P-3R+1=0. 4.12.6 


This is shown in Figure 4.12.1. As can be seen, the equilibrium is a rather 
peculiar one. Any point along the ellipse is stable with respect to perturbations 
changing the frequency of heterozygotes, for the population tends to return 
to the points along the ellipse. On the other hand, there is no tendency to 
return to the original point if there is a change (chance or otherwise) along 
the ellipse. Hence there are an infinity of points that are equilibria of this sort. 
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Figure 4.12.1. The case of complete 
disassortative mating with two alleles 
and three genotypes. The population 
follows the paths indicated by the 
arrows. Points along the ellipse represent 
equilibria. The arrows cross the ellipse, 
since the approach to equilibrium is 
oscillatory. 


The only possible values lie within the triangle. The arrows indicate that 
there is no tendency for the P/R ratio to change. An actual population would 
drift randomly along the curve until one or the other of the two homozygotes 
is lost. Then the situation would reduce to the 2-phenotype polymorphism 
of the type discussed before, AA and Aa in equal proportions, or Aa and aa. 
These represent the two points at the end of the curve. 

Another point of interest about this sytsem is that the approach to the 
ellipse is oscillatory. The population moves in the direction of the arrows 
in Figure 4.12.1, but overshoots each generation so that the value moves 
back and forth along the arrow, crossing the ellipse each time, and with 
decreasing amplitude until the equilibrium is reached. 

In plants where there is an incompatibility system where the pollen 
function depends on the genotype of the plant that produces the pollen 
rather than the specific allele in the pollen grain itself, there are two possible 
mechanisms whose consequences differ slightly. It may be that incompatible 
pollen fails to fertilize the ovile (pollen elimination); alternatively, the fertil- 
ization may occur, but this particular ovule then fails to develop if the mating 
is incompatible (zygote elimination). The model we have just discussed is 
equivalent to zygote elimination. For a discussion, see Finney (1952) and 
Moran (1962). 
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4.13 Problems 


l. 


ll. 


12. 


Compute the mean and the genic, dominance, and genotypic variances at 
inbreeding coefficients 0, 1/2, and 1 for the following four examples. 


Yi Yi: Y22 P, P; 
(a) 99 100 101 
(b) 99 100 100 
(c) 100 100 99 
(d) 99 100 99 


GA v o'S 
nA — — O 


. For the system, 44,55 412 = 455 (i.e., A, completely recessive), what 


gene frequency maximizes the genic variance? The dominance variance? 
The total variance? (Assume random mating.) 


. Give two reasons why the correlation between mother and daughter is 


likely to be lower for human weight than the correlation between sisters. 


. In terms of the model at the beginning of Section 3.10, show that the genic 


variance and dominancevariance for f= 0 are 2p, p;[A + D(p, — p]? 
and (2p, p; DY. 


. What are the k-coefficients for a child and grandparent, for half-sibs, for 


uncle and niece, and for individuals D and H in pedigree 3.4.2? 


. Show that 


2ko = (1 — fael — 2fnp) + (1 — 2/45X 1 — fac) — 2Cf4c fon + fac San): 


. Show that the dominance variance with random mating is the square of 


the difference in the population means at f = 0 and f= 1. 


. Relatives such that k, — 0 are sometimes called unilineal (Cotterman, 


1941) and those with k, greater than 0 are bilineal. Give an example of a 
bilineal relationship other than identical twins, sibs, and double first 
cousins. 


. Show that for additive genes (no dominance, no epistasis, no environment 


effect) the correlation between parent and offspring is 


1+2f,+f, 
2/0 + fr + fy) 


where f, and f, are the inbreeding coefficients of the offspring and parent. 


. In deriving 4.4.4 we assumed that the covariance of the sums of two quan- 


tities is not changed if the quantities are correlated. Prove this. 

Show from 4.11.9 that the equilibrium value of q for R = 1/2 is 

(s + /Q — 31 —s))/2. 

What is the limit of d in Problem 11 as s approaches 0? Show that the 
genotypes at this point are in Hardy-Weinberg ratios. 


172 


13. 


AN INTRODUCTION TO POPULATION GENETICS THEORY 


What is the heritability of a trait determined by a very rare recessive gene? 
What is the correlation between sibs ? 


. Show that with “ pure" overdominance (a, ; > d,, = @22) the heritability 


is (p, — P2)?/(pi + 3). 


. Zand J are first cousins. J has phenylketonuria, the allele frequency being 


(say) 0.01. What is the probability that J has this recessive disease? 
Is the answer given by using k-coefficients exact ? 


. Show that with three self-sterility alleles the proportion, P,, of S,S, 


heterozygotes in generation ¢ is (1 — P, ,)/2. What is the equilibrium 
proportion? Is the approach direct or oscillatory ? 


. Assuming the model of Table 4.4.2, what is the covariance of second 


cousins? Of individuals D and H in Pedigree 3.4.2? 


. Compare R from 4.11.5 for r — r' 20.5 with r=0.5, r' 20 when 


q = 0.01. Do they differ appreciably ? 


. Suppose that in a randomly mating population the heritability, H (= /?) 


of IQ is 0.6 and that the correlation between husband and wife is 0.5. 
By what fraction will the variance be increased when the population 
reaches equilibrium under this degree of assortative marriage? Compare 
this with the amount when the heritability is 1. If, prior to the beginning 
of assortative marriage, the IQ distribution had a mean of 100 and a 
standard deviation of 15, what fraction of the population would have 
IQ's above 130 before and after? 


SELECTION 


election occurs when one genotype leaves a different number of progeny 

than another. This may happen because of differences in survival, in 

mating, or in fertility. We are, as before, ignoring for the present 
differences that arise from random fluctuations. As mentioned in the pre- 
ceding chapter, selection is distinguished from inbreeding and pure assorta- 
tive mating in that under the latter systems the number of descendants is 
the same for all genotypes. Selection, along with migration and mutation, may 
alter the gene frequencies. However, it does not necessarily do so; it may be 
that the fitnesses of the different genotypes differ, but in such a way that 
opposing tendencies balance and the gene frequency is unchanged. Selection 
may be because of the greater fitness of some types, as in nature, or through 
artificial selection as practiced by the animal and plant breeders. 

Sewall Wright (1931) has said: 


Selection, whether in mortality, mating or fecundity, applies to the organism 
as a whole and thus to the effects of the entire gene system rather than to single 
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genes. A gene which is more favorable than its allelomorph in one combination 
may be less favorable in another. Even in the case of cumulative effects, there 
is generally an optimum grade of development of the character and a given 
plus gene will be favorably selected in combinations below the optimum but 
selected against in combinations above the optimum. Again the greater the 
number of unfixed genes in a population, the smaller must be the average 
effectiveness of selection for each one of them. The more intense the selection 
in one respect, the less effective it can be in others. The selection coefficient 
for a gene is thus in general a function of the entire system of gene frequencies. 
As a first approximation, relating to a given population at a given moment, 
one may, however, assume a constant net selection coefficient for each gene. 


Selection involving both mortality and fertility is almost always com- 
plicated. One consequence of differential mortality is that a population 
counted at any stage except as zygotes will usually depart from Hardy- 
Weinberg ratios, even when mating is random. This would suggest that the 
proper time to census a population would be as soon as possible after fertili- 
zation. On the other hand, from the standpoint of assessing the effects of 
random gene frequency drift, it is more meaningful to count adults at the 
beginning of the reproductive period (Wright, 1931; Fisher, 1939a). When 
the probability of mating or the fertility is being considered, it may be more 
meaningful to measure the fertility of mating pairs than of individuals 
(Bodmer, 1965). 

The systematic quantitative theory of natural selection came of age with 
a series of papers by Haldane (1924-1931). In the beginning of the first 
paper he said: 


A satisfactory theory of natural selection must be quantitative. In order to 
establish the view that natural selection is capable of accounting for the known 
facts of evolution we must show not only that it can cause a species to change, 
but that it can cause it to change at a rate which will account for present and 
past transmutations. In any case we must specify: 

(1) The mode of inheritance of the character considered, 

(2) The system of breeding in the group of organisms studied, 

(3) The intensity of selection, 

(4) Its incidence (e.g. on both sexes or only one), and 

(5) The rate at which the proportion of organisms showing the character 

increases or diminishes. 
It should then be possible to obtain an equation connecting (3) and (5). 


Starting with the simplest cases—a single pair of alleles, random mating, 
discrete generations, constant selection coefficients equal in the two sexes— 
he proceeded to more and more complex cases. These included non-Mendelian 
inheritance, different intensities in the two sexes, within-family selection, 
X-linkage, inbreeding and assortative mating, multiple factors, linkage, poly- 
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ploidy, sex-limited characters, reversal of dominance in the two sexes, gametic 
selection in one or both sexes, multiple-recessive and multiple-dominant 
traits, and overlapping generations. These early studies are summarized in 
his 1932 book The Causes of Evolution. The ways that selection can operate 
are uncountable, and many special cases are of genetic interest. However, 
we shall discuss only a few in order to illustrate general principles. 

The effects of selection were also studied by R. A. Fisher and Sewall 
Wright, with a greater emphasis on quantitative traits. We shall also discuss 
the main generalities arising from these studies. 

Natural selection, like classical mechanics, has both static and dynamic 
aspects. The statics of evolution will be dealt with in Chapter 6. This involves 
the relatively stable situation that results from the balance of various opposing 
forces—mutation, selection, migration, and random fluctuations. In this 
chapter we consider the dynamics—the way in which selection changes the 
composition of a population. 

We shall consider two models, models 1 and 2 of Chapter 1. The first 
assumes that generations are discrete and nonoverlapping, as in annual plants. 
This is also applicable to many problems in animal breeding where pedigrees 
are known, and the generations can therefore be kept straight. 

The second model applies strictly to organisms that reproduce and die 
continuously and with a constant probability of both, a situation approxi- 
mated by some single-celled organisms. However, we should like to use it as 
an approximation to the situation in many organisms where generations 
overlap. The approximation is best when the population has reached stability 
of age distribution, as discussed in Chapter 1. 

As expected, the two models become quite similar when the intensity of 
selection for the trait under consideration is small. For populations not in 
age-distribution equilibrium, it may be useful to weight each individual by 
its reproductive value (see Section 1.5). However, most of the time we shall 
use the equations in a simple form, regarding them as useful approximations 
from which we can reach interesting qualitative and semiquantitative general- 
izations. 


5.1 Discrete Generations: Complete Selection 


As a first example, we consider simple cases in which one class is lethal or 
sterile. In animal or plant breeding this corresponds to culling one class 
completely. 


1. Selection Against a Dominant Allele If there is complete selection 
against a dominant factor, that is, if this phenotype is eliminated or fails to 
reproduce, the population next generation will be composed entirely of 
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homozygous recessives (except for new mutations, incomplete penetrance, 
and such complications). So, one generation of selection is sufficient to elimin- 
ate the undesired type from the population. 


2. Selection Against a Recessive Allele On the other hand, if all 
homozygous recessives are eliminated there are still recessive factors re- 
maining in the population, hidden by heterozygosis with their dominant 
alleles. These can combine in later generations to produce zygotes that are 
homozygous for the recessive gene. 

If the proportion of recessive genes in generation O is pọ and mating is 
at random, there will be på recessive homozygotes. When these are eliminated, 
the only possibility for a homozygous-recessive offspring is by the mating 
of two heterozygotes, in which case 1/4 of the offspring are expected to be 
homozygous recessives. Among the dominant phenotypes, a fraction 
2po(1 — poYy[(1 — po)? + 2po(1 — po)] will be heterozygous. The proportion 
of recessive homozygotes next generation will then be 


Al 2po(l — po) I Do J 
4 L(1 — po)’ + 2po(1 — po) 1+ pol - 


In the next generation, we can replace p, by pg/(l + po), and so on, 
leading to the following formulae for the change in gene frequency. 

















p= Po 
1+ Do’ 
Pi Po 
= = —, 5.1.1 
HET uem Ie pe 
p= Po 
E Lti’ 


and the zygotic frequencies are given by the squares of these quantities. 
Another form of expression that might be mnemonically preferable is 
obtained by replacing p with I/y. This leads to 


l l l 


y dti t+Yo 





5.1.2 


The proportion of recessive homozygotes is the square of these quantities. 

For example, if the present frequency of a rare recessive disease is 
1/40,000 or 1/2007, and if none of these reproduce, the number next generation 
will decrease to 1/201? or 1/40,301. This provides a numerical illustration of 
what is already well known—-that selection against a recessive that is rare 
goes very slowly. 
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3. Selection Against a Sex-linked Recessive Trait If there is com- 
plete elimination of homozygous-recessive females and hemizygous-recessive 
males, then after the first generation there will be no more homozygous- 
recessive females. From that time on, all the recessive phenotypes will be 
males, who in turn came from heterozygous females. A heterozygous female 
may transmit the recessive gene to a son or to a daughter. In the former case 
it is eliminated ; in the latter it is retained in the population. The proportion 
of heterozygotes among the daughters of affected females is 1/2. Since the 
affected males come from heterozygous females, they too are reduced by 
half each generation, the proportion of affected among males in any genera- 
tion being exactly half the proportion of heterozygotes among females in the 
previous generation. 

The results that we have just discussed are shown graphically in Figure 
5.].1. Note in particular the very slow rate of decrease of homozygotes for 
autosomal recessives, once the gene has become rare. 

Figure 5.1.2 shows actual data on the decrease in frequency of a recessive 
lethal gene in Drosophila. In this experiment the generations were kept 
separated so the conditions of the model are fulfilled in this regard. In each 
generation the adults will be of two genotypes, AA and Aa, the aa type having 
died in the pre-adult stages. The two surviving types were classified by progeny 


0.25 


0.125 


Phenotype Frequency 





Time in Generations 


Figure 5.1.1. Selection against a recessive (R), dominant (D), and an 
X-chromosomal recessive ( X). Selection is assumed to be complete, 
and in each case the starting frequency of the trait is 0.25. In the 
X-chromosome case it is assumed that there are no homozygous- 
recessive females, as would be the case after one generation of 
selection. 


178 AN INTRODUCTION TO POPULATION GENETICS THEORY 


0.6 


0.5 


0.4 


9 0.3 


02 


0.1 





5 
t 
6 
t 


4 


Figure 5.1.2. Selection against a recessive lethal gene. The allele 
frequency is given on the ordinate. In the abscissa f is the generation 
when adults are counted and z’ is the generation for zygote gene 
frequency. Data from Wallace (Amer. Natur. 97: 65-66, 1963). 


tests and the proportion of a genes among these is shown in the graph for 
each generation. 

In generation O the adults were all Aa, since the experiment was started 
with flies all of the same heterozygous genotype. Thus p, = 1/2. This is also 
the expected proportion of a alleles among the zygotes in the next generation. 
At the adult stage the proportion of a alleles will have changed to 1/3, 
according to the relations in 5.1.1. Inthe figure the generations corresponding 
to the adult gene frequencies are indicated by t; those for zygote gene fre- 
quencies are indicated by t’. 

The data agree approximately with the theoretical expectations, the 
standard errors of the points being large enough that most deviations are 
not significant. However, when all generations are considered there is some 
selection against the heterozygotes. 


5.2 Discrete Generations: Partial Selection 


We define the fitness, or selective value, as the expected number of progeny 
per parent. The parents and progeny must be counted at the same age, of 
course. The effects of differential mortality and fertility are kept within the 
same generation if each generation is enumerated at the zygote stage. In a 
biparental population, half of the progeny are credited to each parent. 
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Asexual Population Consider that there are many genetic types in the 
population, and that each reproduces its own type exactly. The model is 
also appropriate for a Y-chromosome factor (if only males are considered), 
for an X-chromosomal factor in either sex in an attached- X stock, or cyto- 
plasmic inheritance transmitted through one sex. 

Assume that the genotypes A,, 45, A45, ... have fitnesses w,, W3, w3, ... 
and are present in the population in frequencies p,, P2, P3, .... Then the 
proportion of A; genotypes next generation will be 


PiWi . PiWi 


Pi T piw, + pwm + w 


where w = }_ p; w; is the average fitness. 
The change in the proportion of 4; in one generation is 


,W; (Ww; — Ww 
s (5 AWW ) 5.2.1 


w w 


Api 


The quantity w; — W is the average excess in fitness of the genotype A;. For 
only two alleles, this formula is conveniently written as 


Apa Papal — wj) " SPiP2 -— 


Ww 


where s, the selection coefficient, is w, — w, and p, + p; = 1. 

Notice that selection is most rapid when the two types are nearly equal 
in frequency and becomes slower when one is much more common than the 
other. For example, if w, is 1.1 and w; is .9, the frequency of A, will increase 
from .50 to .55 in one generation, but if the frequency is 0.10, it will only 
change to .1195 in one generation. 


Diploid Sexual Population We now let w;; stand for the average fitness 
of the genotype 4; Aj. As before we let P;; be the frequency of the homozygous 
genotype 4; A; and 2P;; the frequency of the heterozygote 4; Aj. Then the 
frequency of the gene 4; is (from equation 2.1.2) 


Pi =}, Pig. 
j 


Next generation the proportion of A; genes will be 


w=) } Py wy = L piwi 5.2.4 
5 J 
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and 
. P.. w.. 
w = Des Py my 5.2.5 
Di 
Hence, 
dw. — Ww 
Ap; = Bins»). 5.2.6 
w 


The formula is the same as that for asexual selection, but w; now has a 
more complex meaning. In this equation w; is the average fitness of the A; 
allele; more specifically, it is the average fitness of all genotypes containing 
A,;, weighted by the frequency of the genotype and by the number of 4; 
alleles (1 or 2). Then w, the average fitness of the population, can be expressed 
in either of two ways: (1) the average of all the genotypes in the population, 
and (2) the average fitness of all the genes at this locus. These correspond to 
the two expressions given in 5.2.4. 

Equation 5.2.6 shows that the rate of change of the gene frequency is 
proportional to: 


(1) The gene frequency, p;. Thus a very rare gene will change slowly, 
regardless of how strongly it is selected. 

(2) The average excess in fitness of the A; allele over the population 
average. If the excess, w; — Ww, is positive, the allele will increase; if 
negative, it will decrease. If this is large the frequency of A; will 
change rapidly; if small, slowly. 


Notice also that the gene frequency change will be slow when the allele 
becomes very common (p, > 1). In this case, w; and w are not very different, 
since most of the population contains the A; gene. This point can be brought 
out by rewriting 5.2.6 in another way: 


Ap, — pi — Pw — wy) 


i , 5.2.7 
w 


where w, is the average fitness of all alleles other than A;. This shows clearly 
that Ap; approaches 0 as p; gets near to either O or |. 
If the population is in random-mating proportions we can write 5.2.7 


in still another way, often used by Wright (e.g., 1949). With random mating 
Pi; = p, p; and 


ee y 34 Di Pj Wi; = Y pw 


"In 5.2.8 
1i? 
Pi J 
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or, representing all alleles that are not A; as collectively A, with frequency 
Px = | — Pi: 


Ly Pep ig La PiWi 


J#i 


Yi Ya Py Pa Wj : ; 
Moe j,k zi 
Ww; S PiWi + Px Wixs 


Wy = Pi Wyi + Px Wyx ’ 


5.2.9 


where wj, = n; is the average fitness of all heterozygotes where one allele is 
A, and w, is the average of all genotypes where neither is A;. In this notation, 
still assuming random-mating proportions, 


W= Divi t 2p(1 — p)wi, + (1 — p) Wax + 5.2.10 


In this formulation w;, and w,, are not constants, but depend on the relative 
frequencies of the non-A, alleles (except when there are only two alleles). 
Notice that the partial derivative of w with respect to p; is 
ow i 
Ae 2P; Wi — 2Pi Wix + 2(1 — Pi)Wix — 2(1 — pw. 
Pi 


= 2(w; A w,). 


§.2.11 


The two quantities wj, and w,, are treated as constants in this differentiation. 
Substituting this into 5.2.7 gives Wright’s formula 


_ PC — pi) Ow 


AD; i 
Pi 2w Op; 


5.2.12 

In analogy with physical theory we can regard w as a potential surface, 
in which case 0w/dp; becomes the slope of the surface with respect to p;. 
Treating w;, and w,, as constants is equivalent to treating all allele frequencies 
except p; as constant fractions of | — p; (see 5.2.9). Thus dw/dp; is the slope 
of w in the direction where the relative frequencies of the other alleles do not 
change. The gene frequency moves over the surface at a rate proportional to 
the slope, but governed by the term p;(1 — p;). For extensions to populations 
not in random proportions, see Wright (1942, 1949). 

In more complicated situations there may not be a fitness surface, W, 
that is a function of the gene frequencies. For example, the fitnesses of the 
individual genotypes may not be constants, or there may be complications 
from linkage and epistasis. In such cases the fitness may not even increase 
at all. For a discussion, see Wright (1942, 1955, 1967) and Moran (1964). 
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When there are only two alleles, we can use equation 5.2.7 (for example) 
with i2 1 and x 22. We shall now write explicit formulae for the 
2-allele situation with random mating in several special cases. The fitnesses 
and frequencies of the three genotypes are given below: 


GENOTYPE A14; A,A2 A2A2 
FITNESS Wit Wia W22 
FREQUENCY Pi 2pip2 pi 


Then, from 5.2.9 and 5.2.7, 


w = Wi1Di + Who P2> 


WP = W12 Pi + W22 p», 


2 PiPlL(wii — wi2)pi + (War — W22)P2) 





Ap, 5.2.13 
Ww 
a. No dominance, w,; = (w11 + w52)/2. 
Wi; —W23) S 
Ap, _ PiPX 1 h PiP2 — 
2w 2w 
where s = wii — W22 
b. Dominant favored, w,, = (1,5, 4, dominant. 
2 
s 
Ap, = SP1P2 5.2.15 
w 
c. Recessive favored, wi = W22, A, recessive. 
2 
S 
Ap, = Peps: 5.2.16 
d. Asexual, haploid, or gametic selection (5.2.2). 
Spipz 
Ap, = ——. 5.2.17 
w 


These four formulae illustrate several important facts about selection. 
One is that asexual selection is more effective than sexual when the whole 
range of gene frequencies is considered. A diploid sexual population with no 
dominance evolves half as fast as an asexual population in which the two 
types differ by the same amount as the two homozygotes in the sexual popu- 
lation. Haploid or gametic selection is equivalent to asexual for a single 
locus. 

Comparison of b and c shows that selection is most effective when both 
the dominant and recessive genes are of intermediate frequency. In fact, the 
maximum rate of change is when the recessive allele is twice as frequent as 
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the dominant. Selection becomes decreasingly effective as the recessive gene 
becomes rare and the p? term gets closer to 0. Notice, by comparison with 
Table 4.1.1, that the situation where selection is inefficient is that where the 
heritability is low, as expected. 

A way of expressing partial dominance in a convenient manner is to use 
the terminology of Wright (1931 and later). We assign fitnesses and symbols 
to the genotypes as follows: 


GENOTYPE AA Aa aa 
FREQUENCY p? 2pq q? 
FITNESS (RELATIVE) 1 1 — hs 1—s 


In these terms, 


lios —spa[a + h(p — a)] 
1 — sq(2hp + q) 


The quantity, s, often is referred to as the selection coefficient. The quantity h 
is a measure of dominance. When the a allele is recessive, // = 0. When a is 
dominant, h = 1. When A is negative there is an overdominance. 

This formula brings out the importance of partial dominance when there 
is selection on a rare gene. If a is completely recessive and q is small the rate 
of change is proportional to g?. When q is small and h larger than q, the 
corresponding term is /ig (since p is very nearly unity). Thus, even a small 
amount of partial dominance (say, h = .05 or less) may make a great deal 
of difference in the nature of selection involving rare genes. This becomes 
especially important in the consideration of equilibrium gene frequencies 
when selection against a recessive genotype is balanced by new mutations; 
this will be discussed in the next chapter. 

As mentioned earlier, the number of interesting special cases is almost 
endless. Many of them were worked out by Haldane and many more have 
been done since. We shall at this point mention only two. 

It is probable that in nature much of selection is based on fertility. It is 
likely to be the rule, rather than the exception, that a gene has different 
fertility effects in the two sexes. Such differences are also frequent in mortality 
rates, as well. So, if we are to have a model that is at all realistic, it should 
take into account the possibility of different selection coefficients in the 
two sexes. 

Fortunately, there is an easy solution; the considerations of equation 
1.1.4 are applicable, and the combined fitness is the average of the two sexes. 
Furthermore, it is the simple average, since each individual is derived from 
one sperm and one egg. Therefore, except for X- and Y-linked genes, the 
contribution of the two sexes is the same, and the overall fitness of a geno- 
type is the unweighted average of that in the two sexes. This is not exact, 


5.2.18 
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however, for the resulting differences in allele frequencies in the two sexes 
at the time of mating may lead to departures from Hardy-Weinberg propor- 
tions (see 2.5.1); but the formulae are usually satisfactory as an approximation. 

X- and Y-linked genes cause no particular difficulty. Genes that are on 
the Y chromosome are found only in the heterogametic sex. The situation is 
entirely equivalent to asexual or gametic selection where only males (or 
females if they are XY in this species) are counted. 

For an X-linked trait we note that the gene in a male is derived from 
his mother whereas those in a female are derived equally from the two 
parents. Therefore the frequency in males is given by the formula applied to 
the fitness and frequency in females of the previous generation. The gene 
frequency in females is given by the average for the two sexes in the previous 
generation. Specifically, if q, and q; stand for the frequencies of the gene of 
interest in males and females, and we use primes to designate the next 
generation, 


w,— W 
P Ei s) 


Gm = d; 5.2.19 
m Wy 
and 
1 Wr—W m Wm — W 
a; = 5 [as a, + OL poo ccm ail 5.2.20 
Wy Wm 


The symbols w,, and w, stand for the average fitness of the allele in males 
and females, respectively. As mentioned above, these formulae are not exact 
because of departures from Hardy-Weinberg proportions; this may become 
important if selection is very intense. 

Throughout this section we have spoken of gene frequency changes 
rather than changes of genotype frequencies. Formulae can be written for 
changes in the genotypes directly, but they are much more cumbersome. We 
effect a great simplification by working directly with the gene frequencies. 
Furthermore, the zygotic types are put together and taken apart every genera- 
tion by the Mendelian processes of segregation and recombination, so that a 
zygote type (when many loci are considered) may never again be reconsti- 
tuted. For these reasons, almost all of selection theory deals with changes in 
gene frequencies. 

This has a price, however. There is usually some inaccuracy in going 
from gene frequencies to zygotic frequencies. We need to know something 
about the mating system. Even if mating is completely at random, there will 
be departures from Hardy-Weinberg ratios in all stages after mortality begins. 

We therefore regard the procedures as useful approximations rather than 
exact formulae. We follow the changes in gene frequencies; then we get the 
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genotype frequencies by the Hardy-Weinberg principle, or some modification 
thereof to include nonrandom-mating effects. If possible, we count the popu- 
lation at the zygote stage. Fortunately, much selection of evolutionary interest 
is relatively slow, and the Hardy-Weinberg ratios are very good approxi- 
mations even for adult populations. 

If we try to take into account the various complexities of populations 
in the real world the formulae naturally become more complicated. One 
obvious extension of what we have been doing is to consider survival and 
fertility as separate aspects of fitness. We illustrate with a special case. 

Consider a locus with two alleles, A, and A,, and with viabilities and 
fertilities as given below. 


GENOTYPE AA, A,Az Aii 
VIABILITY Vin Vi2 022 
FERTILITY Sa fiz fai 


The total fitness of a genotype will be the product of its viability and fertility. 

The part of the life cycle in which survival is important (from the stand- 
point of natural selection) is that prior to reproduction, so we let vj; be the 
survival to the time of reproduction. We are still assuming that generations 
are discrete and that matings take place at random among the adults. 

If the proportions of the three genotypes are P,,, 2P,;, and P32, and 
the enumeration is made at the zygote stage, then the combined survival and 
fertility (or expected number of progeny, crediting half to each parent) of 
A, A; is v;; fij. Letting vij fi; = wij, the equations of the earlier parts of this 
section (e.g., 5.2.6) are applicable. Equation 5.2.6 gives the changes in gene 
frequency and the proportions of the three zygotic types are p?, 2p, p, , and 
pi, where p, and p; are the new gene frequencies. 

If, on the other hand, the population is enumerated at the adult stage 
the situation is more complicated. For one thing, the genotypes at this stage 
are no longer in Hardy-Weinberg ratios. Suppose the population is censused 
just before reproduction. Then v;, is the viability up to this stage and /;; is 
the fertility. (Deaths that occur during the reproductive period can be 
accommodated by regarding them as reducing f.) Let P,,, 2P,,, and P32 
stand for the proportions of the three genotypes at the stage of enumeration. 

We can obtain the proportion of zygotes next generation as follows. 
The A, genes contributed will be proportional to P, f,; + P1; fi; and the A; 
genes proportional to P,5f,;-- P;;f;;. With random mating the three 
zygotic types will be in the ratio 


(Py, fiai + Piza)? XP + Piifi(Piifiai + P22 fa): 
(Pro fiz + Paaha). 
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The adult frequencies next generation, indicated by primes, are 


(Piafi + Piafi tr 








Pit = K > 
2P; = X(P fa + Piz fi Piz fi + Pra faa Wiz A 
K 
P!  (Pizfi2+ P32 J22) 022 
22 — K > 


where K is the sum of the numerators and is introduced to make the fre- 
quencies total 1. 

Successive application of these formulae gives the frequencies in later 
generations, but the equations are no longer simple functions of the allele 
frequencies. In most such cases and in more complicated ones the only way 
to get the results is to grind them out generation by generation. Of course 
high speed computers are a godsend for numerical results. 

In some cases a simplifying transformation can be found. One such 
appears when one class is lethal or sterile (Teissier, 1944; Crow and Chung, 
1967; Anderson, 1969). Suppose the A, A, class dies before the age of enumer- 
ation. Then, since we are interested in relative rather than absolute frequencies, 
let us arbitrarily choose 1.0 as the viability and fertility of one class. Accord- 
ingly, let 








vi = 1, U27—tU, t22 = 0, 
fi” l, fia f. 
Then 

P' (Pu + P4) 
11 7 K , 

P' (Pu + Piz f)P 42 fv 5.2.22 
12 — K , 

P}, =0. 


Note that the frequency, say q, of allele A, at the stage of enumeration is 
q = P, 
since 2P, 2 is the proportion of heterozygotes, and P4, + 2P,, = 1. 
We can simplify things by letting 
Py,  1-—2q 


yes Ta ie 5.2.23 
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Then the value of y next generation is 


ees = fut Tal (7) 45 5.2.24 
2P. 2P,;fv fj] 2v' = 


and we can write the simple recurrence relation 





Jii = ay, +b, 5.2.25 
where 
a= l/vf and b = 1/2v. 


This gives the ratio of normal homozygotes to lethal heterozygotes at 
the adult stage. If we want the lethal-allele frequency, it is given each genera- 
tion by 
7 l 

XE y.) 

An approximate expression for y, for any t can be obtained by writing 
an expression for the rate of change of y and treating this as a differential 
equation. 


q: 5.2.26 


l—vf 1 dy 
a ET tu AR 


This integrates into 


y, 7 Qo + C)e*' — C, 5.2.27 
where 
gato f 





vf ’ CS a 


Figure 5.2.1 shows some data from Drosophila population cages. The 
flies were of three genotypes, +/+ (normal), +/Sb (Stubble bristles), and 
Sb/Sb which is lethal in the larval stages and is not observed in the adult 
population. The data points are from weekly censuses and record the pro- 
portion of the Sb gene in the adult population. The data are an average of 
four populations, from each of which a sample of 200 adults was classified 
each week. The populations numbered several hundred. 

The average generation length under these circumstances is estimated to 
be about 2.5 weeks. The dotted line is the expected proportion of Sb chromo- 
somes if the gene is completely recessive and treating the situation as if the 
discrete model were appropriate (equation 5.1.1). The starting frequency, qo, 
is taken as 0.3. 
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Time in Generations 


Figure 5.2.1. Selection against a recessive lethal gene that produces 
Stubble bristles when heterozygous. Abscissa: time in generations. 
Ordinate: Sb gene frequency at the adult stage. The dotted line shows 
expectation for fully recessive lethal; solid line, 12% disadvantage of 
heterozygote. (Data from W. Y. Chung.) 


The solid line is obtained from 5.2.25 and 5.2.26, using v = .970, f = .907, 
fv = .880, and yo 2.667 (corresponding to qo = .30). As can be seen, al- 
though the population is changing continuously and the generations overlap, 
the data fit the expectation very well. 

The approximation 5.2.27, gotten by treating the process as if continuous, 
gives results that are almost indistinguishable from 5.2.22. Furthermore, it is 
not very important in this example to separate viability from fertility. For 
example, if we let v = 0.88 and f= 1.00, starting with a frequency of 0.30 
we have after 10 generations a frequency of 0.035; with v = 1.00 and f = 0.88 
we have 0.039, not very different. The product of v and fis more important 
than either of the components. 

For a detailed discussion of the practical procedures for measuring v 
and fin actual populations, see Anderson (1969). 

We can summarize this discussion of the time of enumeration by re- 
writing the formulae, and at the same time extending them to include multiple 
alleles. We are still assuming random mating. 

Enumeration at the zygote stage: 


(Y; Pij vfu) Pij vifi 


Pi; = C 5.2.28 
Enumeration at the adult stage: 

7 (Y; Pi Jiu Pi fi "ij 
Pi; = Me UL Me UNUM 5.2.29 


K 
C and K are chosen to make the frequencies total to 1. 
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Note that, for zygote enumeration, v; fi; = w;; and from 5.2.5 


DiWi — x Pivijfij- 5.2.30 
From the definition of gene frequency (2.1.2), 
, ; ; DiWi DjWj 
J 
= 5.2.31 
= PiWiW 
C » 


since Y'p,w, = W. 


If we sum both sides of 5.2.31, and note that Y' p; = 1 and Y p; w; = W, 
we see that C = w?, so 
, PiWi 
pi = ——,; 5.2.32 
w 
corresponding to 5.2.3. 
For adult enumeration, noting that 





pif; = » Püfij 
dis Zi php Pu 5.2.33 
= Dif ii 
K^ 
where 
jj Y P;f; dij. 5.2.34 
j 


Summing, as before, we discover that K = w, where 


w=) pifivi. 5.2.35 
Putting all this together, we have: 


ZYGOTE ENUMERATION ADULT ENUMERATION 
: (w,—w (0, f, — w) 
Ap. EN d E ) Ap, LP di 
Ww 
wi = È pyvufu Pfi = 2 Pufu, u= 2 pii vij 5.2.36 
J 
w => > PiP) Duty w= 2 2 PPSS Vij 
TT 
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In these formulae, w; is the weighted average of the product of v;; and 
fij, while vj f; is the product of the weighted averages of v;; and f;;. The gene 
frequencies refer, of course, to the frequencies at the time of enumeration. 

As we saw in the numerical example, which involved rather strong 
selection (a lethal homozygote and a heterozygote with about 127, dis- 
advantage), the results were approximated rather well by a model assuming 
that all the selection is through fertility differences and also by one assuming 
that all the selection is through viability differences of the heterozygote. 
Furthermore, a continuous approximation arrived at by treating the change 
by a differential equation also gave a good approximation to the results. 
This confirms our intuitive judgment that, unless selection is quite intense, 
there is not very much difference between the various models. 

We shall sometimes use a discontinuous model and sometimes a contin- 
uous one, making the choice onthe basis of whichseems more natural or which 
is more manageable. We turn now to a discussion of the continuous model. 


5.3 Continuous Model with Overlapping Generations 


Many populations in nature have births and deaths occurring more or less 
continuously, with both reproduction and mortality at various ages. Under 
these circumstances the continuous models (models 2 and 4 of Chapter 1) are 
more appropriate. We shall develop formulae analogous to those of the 
preceding section. 

Fitness is measured in terms of the Malthusian parameter, m. This is 
the rate of geometric increase such that the contribution of a class to the next 
generaiion is proportional to e". We expect equations of the general type 
discussed in Section 1.6. 

Again, as in the last section, we concentrate on gene frequency changes. 
Because of mortality selection the Hardy-Weinberg (or other specified) ratios 
at birth will be changed as each cohort gets older. So there will ordinarily be 
no stage in which the entire population is in these ratios. One procedure that 
would at least partially mitigate this difficulty is to enumerate the population 
at birth, then get the proportion at different ages from life-table information; 
in other words, use genetic information only to predict the number of each 
genotype born at a particular instant. 

However. we are mainly concerned with approximate results. So we 
assume that the conditions of model 2 are reasonably well met and that we 
are interested primarily in gene frequency changes. 

As in Section 1.2 we let b and d stand for birth and death rates. For 
example, the genotype 4; A; would have a probability b;j At of giving birth 
and d;; At of dying during the infinitesimal time interval At. We let mj; = 
b,j — d 


ij 
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Let 2N stand for the total number of genes at the A locus in the diploid 
population and 2n; for the number of A; alleles. Then p; = nj/ N. During the 
time interval At the increase in population number due to the contribution 
from 4, A; parents is NPi; mi; At, which is also a measure of increase in A; 
genes due to A; A; parents, since each parent contributes one A; gene to each 
progeny. Likewise for any genotype 4;4;, the increase in A; genes due to 
contributions from this genotype is NP;;mi; At (only half the total frequency 
of A,A; is used since only half the contributed genes are A;). Thus, when 
At becomes small, 


d 

ED = 3 NP;; m;; = m;ni, 5.3.1 

m; = 2, Numi _ ALIE = 2; Pu mi 5.3.2 
nj » 


where m; is the average fitness of gene A; measured in Malthusian parameters. 
Likewise, 


— =) NP; m; = MN, 5.3.3 


m= È} P; -» pmi, 5.3.4 


where m is the average fitness of the population, again measured in Mal- 
thusian parameters. A justification that the arithmetic mean of the individual 
m's is appropriate was given in equation 1.2.4. 

From the ordinary rules of differentiation, 


Zt v dni ee 

dp | \N/ dt ' dt 

dt dt — N? 
Nn,m, — n, Nim os 
UN 
= pm, — m), 


where m; — m is the average excess in Malthusian parameters of the allele 4;. 

The similarity of 5.3.5 and 5.2.6 is apparent. The continuous and dis- 
continuous models become more nearly equivalent as the selective differences 
among the genotypes become smaller. If the w's are regarded as relative 
fitnesses and one genotype is assigned the value |, then w is very nearly |. 
Since m,; = log, w;; (cf. Section 1.2), 


W; — W zz e" — e^ — m, — m, 
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and thetwo equations 


dpi — p(m, — m) and Ap, piii w) 
dt Ww 
become nearly equivalent. 

We shall find that for some purposes one formula is more suitable than 
the other. Usually the qualitative results are very much the same so we shall 
often choose whichever model leads to the simplest results. 

For two alleles and random mating, 5.3.5 can be written (in analogy 
with 5.2.7) approximately for slow selection as 


dpi 


p = p,pY(m, — m;). 5.3.6 


This is only approximate because the adults may depart from Hardy-Wein- 
berg ratios. 

Now, consider the same special cases as before. With two alleles and 
random mating we can write the approximate formulae 


_ My Pt + M42 PyP2 _ 
Lo ae A +™2P2; 
1 


Mı = Mı2 Pı + M22 P2- 
Substituting these into 5.3.6 gives 


dp, 


"dt = Py Pl(Mi i — m,3)p, + (m5, — m;3)pi]. 5.3.7 


a. No dominance, m, 2 = (m,; + m;;)/2. 


Pi 
— = sp,p,/2, 5.3.8 
dt 1P2/ 
where s = mji — Ma2. 
b. Dominant favored, m,, = m,,, A, dominant. 
dp, 
—S 5.3.9 
"di pipi. 
c. Recessive favored, m,; =m 2, A, recessive. 
en = spip 5.3.10 
di 1P2. 3. 


d. Asexual, haploid, or gametic selection. 


dp 
s = Spip - 5.3.11 
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The same general observations can be made as were made for the 
discontinuous model. The relative rates of gene frequency change are as 
before. 

Notice that 5.3.11 is the equation of the logistic curve. This is apparent if 
we note the correspondence of p, with N/K and p, with 1 — N/K in equation 
1.6.2. In integrated form, 5.3.11 becomes 


1 


1 XN > 
1+ | Poles 
Po 


where p, is the frequency of the favored gene at time t and pj is the initial 
frequency. 

We can proceed as we did with equation 5.3.11 and write all the equations 
in integrated form. For our purposes it is more convenient to writetheminthe 
form t — f(p), than with p as a function of time. Integrating and letting po 
stand for the initial proportion (when t = 0) and p, = p, equations a, b, c, 
and d become: 


P: = 5.3.12 





a’. No dominance. 
2, Pl — po) 


t=-In 


< 5.3.13 
S  po(1l— p) 


b'. Dominant favored. 
1 1- 1 1 
E [m2 p), E |. 
s 


Po(1 — p) 1—p, 1-—po 


t= 


5.3.14 


Recessive favored. 


fin B= ood 1,1] 


Qo 


t=- —-— 


s po(1 s Pe) Di Po 
d.’ Asexual or haploid. 


6.3.15 


1, Pl — Po) 


tpm 


: 5.3.16 
S po(1 — p) 


In each case p designates the frequency of the favored gene, and t is the 
number of generations required to change the frequency from p, to p,. The 
value of s is assumed to be small enough that the departure from Hardy- 
Weinberg proportions does not introduce serious errors. 

The reason for writing the equations in this form is apparent. [t empha- 
sizes the fact that t is always inversely proportional to s. This is true, or 
approximately true, as long as s is not large enough to appreciably upset the 
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Hardy-Weinberg proportions. It is also true for slow selection with a 
discrete model. This can be seen from equations 5.2.14—5.2.17 by noting that 
when s is small Ap has about the same meaning as dp/dt, and w is approxi- 
mately 1. 

Table 5.3.1 shows the number of generations required to change the 


Table 5.3.1. The rate at which gene and genotype frequencies change under 
selection. 


NUMBER OF GENERATIONS REQUIRED WHEN S — .001 
TO CHANGE GENE FREQUENCY 


from .00001 from .01  from.5 from .99 


‘to .0l to .5 to .99 to .99999 
Asexual 6,921 4,592 4,592 6,921 
No dominance 13,842 9,184 9,184 13,842 
Dominant favored 12,563 5,595 102,595 99,896,918 
Recessive favored 99,896,918 102,595 5,595 12,563 


NUMBER OF GENERATIONS REQUIRED TO CHANGE THE PROPORTION 
OF DOMINANT (OR RECESSIVE) PHENOTYPES 


Dominant favored . 6,920 4,819 11,664 309,780 


Recessive favored 309,780 11,664 4,819 6,920 
Asexual 6,921 4,592 4,592 6,921 


gene or genotype frequency when s = 0.001. For any other value, say s’, 
simply divide the numbers in the table by s'/.001. For example, with s’ = 0.01, 
the times would be only 1/10 as large. 

The values in this table are taken mostly from Haldane. For more ex- 
tensive results and a variety of other cases, see his The Causes of Evolution 
(1932, 1966). 

These equations are written as if s remains constant throughout the entire 
period of gene frequency change. That this should be strictly true is of course 
highly unlikely in any real case. But it gives us a general idea of the times 
involved in evolutionary change and the qualitative effects of dominance and 
recessivity. Furthermore, as emphasized in Section 1.6, equations of this 
type apply to changes in components of a population even though the whole 
population may be increasing, decreasing, or constant and under a variety 


SELECTION 195 


of regulatory mechanisms. Again we see why it is usually convenient in 
population genetics to discuss proportions rather than numbers of individuals 
or of genes. 

Obviously, some of the large values are unrealistic. 99,896,918 genera- 
tions is probably longer than the life of the species, and certainly s is not 
going to be constant for that length of time. Furthermore, when the fre- 
quency of the gene is very near to O or to 1, random fluctuations in gene 
frequency can carry the gene to loss or fixation. A treatment of this problem 
taking chance factors into account has been given by Ewens (1967d). As 
might be expected, the values in the table are quite good for moderately large 
populations in the range of gene frequencies from 0.01 to 0.99, but the 
numbers at the tails of the distribution are often in serious error even in quite 
large populations. In particular, the largest values in the table are much too 
large. 

Later, in Section 8.9, we shall consider the related problem of the length 
of time required for fixation of a mutant gene in a finite population. 


5.4 The Effects of Linkage and Epistasis 


When linkage and epistasis enter the problem, the situation immediately 
gets complicated. For one thing, whereas the Hardy-Weinberg ratios within 
each locus are attained within a single generation, gametic phase equilibrium 
is approached only asymptotically, as was discussed in Section 2.6. So we cannot 
be as free with the assumption of between-locus equilibrium as within a locus. 

We can circumvent this to some extent by using the more generalized 
form of the Hardy-Weinberg principle that we discussed m Section 2.6. 
This states that the array of zygotic frequencies can be written as the 
square of the array of gametic frequencies. So we can deal with the problem 
by treating the chromosome, or the entire gamete, as the unit instead of the 
gene. 

We will discuss the amount of linkage disequilibrium that is produced 
by selection with linkage and epistasis; in fact we shall find that there is 
“linkage ” disequilibrium, even when there is no linkage. We shall also discuss 
the effect of epistasis on the rate of change by selection. 

Many of the problems are still unsolved. It is not difficult to get reason- 
ably good answers when linkage is loose and epistasis is weak. We can also 
treat the situation with very tight linkage as if the linked genes were a single 
gene. But the intermediate area, a small amount of crossing over and strong 
epistasis, is very difficult and no general theory exists. Individual examples 
have sometimes been worked out by computer. With more than two loci, 
the situation naturally gets still more complex. 
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Generation of Gametic Phase Disequilibrium with Epistasis 


Assume that there are two loci, each with two alleles. There are therefore 
four gamete types. For the moment we shall treat the loci as if they were 
completely linked. We can then regard each chromosome as the formal 
equivalent of a gene and assign symbols in the same way. It is as if there 
were a single locus with four alleles. 


CHROMOSOME ab Ab aB AB 
FREQUENCY pi Da P3 Pa 
AVERAGE FITNESS m, ma ms Ma 


Fitness is measured in Malthusian parameters; therefore the model is a 
continuous one. As before, we let 5;; stand for the birth rate and d;; stand 
for the death rate of a particular genotype. The time, t, is measured in 
generations. The genotype Ab/ab would have a frequency 2p,p;. It would 
have a probability 5,; At of giving birth and d, ; At of dying during the small 
time interval At. Then the fitness of Ab/ab is m,; = 6,, — d,. 

In the absence of recombination we can use equation 5.3.5 of the 
previous section and write (assuming that there are Hardy-Weinberg 
proportions) 


dpi EM 
— = pim; — m), 5.4. 
di pi(m; — m) 1 
where 
m; = 2. p;mij 
J 
and 


m-»pmi-2ppm;. 
i i j 


With crossing over the double heterozygote will contribute some gametes 
that are different from those it received from its parents. We measure crossing 
over by c, the recombination fraction. Unlinked genes will be treated as the 
special case when c = 1/2. 

Consider the production of gametes of type ab. With homozygotes and 
single heterozygotes crossing over makes no difference, so their production 
of ab gametes is independent of c. There are 2p, p, Ab/aB double heterozy- 
gotes and a fraction c/2 of their gametes will be ab. There are 2p,p4 AB/ab 
double heterozygotes and a fraction (1 — c)/2 of their gametes will be ab. 
Putting all this together, the change in the number of ab chromosomes in the 
time interval At will be 


2N[pipibii + PiP2 512 + PiP3 b13 + Pi P45, 4(1 — c) + P2 P3b23C 


— pipidii — Pı P2412 — PiP3413 — PiP4dy4]At 5.4.2 
= XNp,m, — Nbc D)At, 
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where 

D = pipa — P2 P3 5.4.3 
and 

b = b, = baz, 


on the entirely reasonable assumption that the two kinds of double hetero- 
zygotes have the same birth rate. 

Meanwhile, the change in the number of all four kinds of gametes 
together in the same time interval is 2NMAt. We then use the same method 
used in deriving 5.3.5 to get the rate of change in the proportions of the 
chromosome types, writing dp,/dt = d(n,/N)/dt, etc. This leads to 


zn = pim, — m) — ebD, 
f = p2(m, — m) + cbD, 
" MM 5.4.4 
os = p4(m, — m) — cbD. 


These equations were first derived by Kimura (1956). 

There is another way of measuring departure from linkage equilibrium 
that is more useful than D for our purposes. Whereas D is the difference in 
frequency between the two types of heterozygotes, we define Z as the ratio 
of the two. Thus 


zo PPa RAS 
PiPa 
and the relation between D and Z is given by 
D= P2 pa(Z = 1). 5.4.6 


Z has the property, first shown by Kimura (1965b), of approaching a 
nearly constant value when mating is at random and recombination is large 
relative to the amount of epistasis. Such a slowly moving equilibrium is called 
quasi-equilibrium. The natural logarithm of Z is 


log, Z = In Z = Ín p, — In p; — In p; + In py. 


We are interested in determining what happens to the linkage dis- 
equilibrium as selection proceeds. To do this we inquire into the rate of 
change of log Z with time. 
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The time derivative of In Z is 


Zdt pdt pm dt py dtp, dt’ a 
Substituting from 5.4.4 and simplifying leads to 
lL aZ 1 1 l l 
zsm -mm tm bD + 4 a) 2s 
= E —cbDP, 
where 
E=m —m,—m,+m, 5.4.9 
and is a measure of epistasis, and 
P=}, Be 5.4.10 
i Pi 


If c is larger than |£| (more specifically, if cb is larger than |E|, but b 
is not far from | for a population of stable size), then the value of Z tends 
toward a value which is relatively stable. We start by writing 5.4.8 again and 
substituting for D from 5.4.6. This gives 

dz 


in EZ + cbZ(] — Z)p; p, P 5.4.11 


= EZ + cb(1 — Z) — cb(1 — Zi (pa + p3)- 


Since 0 < p, + p, € l, we can write 


Z 
p" < EZ + cb(1 — Z), 5.4.12 
dz j 
1 > EZ + ch(l — Z) — cb (1 — Z)* = EZ + cbZ(1 —Z). 5.4.12a 


Consider first a haploid model, in which case the m,’s, and therefore E, 
are constants. We then integrate these two inequalities to obtain 








cb cb—E 
Z< — (cb - E)r = =n A4. 
Sp zl! + Ke T. Ky Za d ) 1, 5.4.13 
E + cb l |! /E+cb 
y/o a O = = | — |- 1. .4.13a 
A cb h Diana K, zi cb | 1 5.4.13 


Assume first that cb > |E|. Then as t gets large the value of Z is given by 


cb - E cb 
<Z< : 
cb cb — E 








5.4.14 
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For example, if c = 10E and b = 1, Z comes to lie between 11/10 and 10/9. 
If E is negative, then Z becomes less than 1. For example, with c= —10£, 
Z comes to lie between 9/10 and 10/11. 

This is equivalent to Felsenstein's (1965) statement that D becomes of 
the same sign as E. We can say that log Z becomes the same sign as E. Of 
course, if the selection is intense and c and E are small, the gene frequency 
change may be so rapid that the whole process is over (or nearly over) before 
the conditions of 5.4.14 are reached. 

If one chromosome type (or pair of genes, if these are on independent 
chromosomes) is ultimately to be fixed, we can ask about the asymptotic 
behavior of Z. Assume, to be specific, that E is positive and that the ab or 
AB chromosome is favored by selection and will eventually prevail. As this 
stage is neared, p; + p, approaches 0, and 5.4.12 and 5.4.13 become equalities 
rather than inequalities. Thus, the limiting value of Z is given by 


cb 


c — cb—E 5.4.15 
If Ab or aB is eventually fixed, then 5.4.12a and 5.4.13a are appropriate and 


| FE 
9? cb `’ 





5.4.15a 


This is illustrated in Figure 5.4.1. Three starting populations are assumed, 
one at linkage equilibrium (Z = 1), one with a large positive departure, and 
one with a large negative departure. E is taken to be 0.1 and c = 0.5. There- 
fore from 5.4.14 we would expect the value of Z to come eventually to lie 
between 1.20 and 1.25. Since the AB type is favored, it will ultimately be 
fixed in each of these examples, and the final value of Z will be 1.25, as 
indicated by the arrow in the figure. 

Figure 5.4.2 shows an example that is the same in every regard except 
that the genes are linked with c — 0.2. In this case, the quasi-equilibrium will 
be between 1.5 and 2.0 with an eventual value of 2. The approach is slower 
because of the lesser recombination. Very roughly the rate of approach to 
quasi-equilibrium is the recombination rate, c. 

Alternatively, if cb « |E| there is not necessarily any limit on Z. Under 
these circumstances, when t gets large 


cb+ E 
cb 





= Z; Ejcb > 1, 5.4.16 


cb 
0<Z< 


<Z2< bE’ Ejcb < —1, 5.4.16a 


with no upper limit in the first case and no positive lower limit in the 
second. 
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Figure 5.4.1. The attainment of quasi-linkage equilibrium with 
epistasis. This is a haploid model in which the fitnesses, measured 

in Malthusian parameters, are as given in the graph. Three starting 
values of Z are assumed. Note that value of Z comes to lie within 
the limits within about half a dozen generations and then very slowly 
changes to the upper limit. The recombination value is taken as 0.5. 


| ə al 


b —0.02 
8|-0.02 0.06 





Figure 5.4.2. The same model except that the recombination 
value, c, is 0.2 instead of 0.5. 
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Further Analysis of the Diploid Case We can be specific only by con- 
sidering particular epistatic models. As was pointed out in Section 3.6, four 
parameters are required to specify the possible kinds of 2-locus epistasis 
(two alleles at each locus). There we used J, J, K, and F; these parameters 
were chosen to bring out some of the effects of epistasis with inbreeding. 
For measuring the effects of directional selection another set of four param- 
eters is more useful; at least it leads to a more transparent interpretation. 
The parameters were introduced by Fisher (1918). Our treatment follows 
Felsenstein (1965) whose paper may be consulted for a more detailed dis- 
cussion. (Note: Felsenstein's Z is our log, Z.) 

The four parameters are 


E, =m, — Ma; — m3, + M415 


E, = m, — M22 — M32 + M42, 


5.4.17 
E; = m, — M23 — M33 + mas, 
E4 = M4 — M24 — M34 + M44. 
But, from the definition of m; for diploids (5.4.1), 
M, = p,m,, + p;m,5 + P3 M3 + p,m,,, 
m = PıM2, + P2M22 + P3 M23 + P4 M24, Kass 
m; = pma, + P2M32 + P3M33 + P4 M34, 
ma = pma, + P2M42 + pyma43 + P4 M44. 
Substituting these into 5.4.8 yields 
1 dZ 
Zar pie, + p; E2 + P3 E3 + P4 E4 — cbDP ee 
= E — cbDP, 
where 
E = p E, + pa E; + p3 E + p4 Ea. 5.4.20 


We cannot proceed as we did with equations 5.4.13 and 5.4.13a because 
E is a function of the p;s rather than a constant. However, we can obtain 
useful but less precise limits, when all the £,’s are less than cb in absolute 
value, by writing in analogy with 5.4.14 

cb + Enin cb 


<Z< 


———., |E,| < eb, 5.4.21 
cb cb — E max 


where Emin and Emax are the smallest and largest values of the E''s. 
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Thus, as in the haploid model, if c is large relative to the absolute values 
of the E;'s, the value of Z eventually comes to lie within specified limits and 
the conditions for quasi-equilibrium are met. If the selective differences are 
small and c is large, the quasi-equilibrium is attained rapidly and the popu- 
lation is in this state when the gene frequencies are changing. 

Table 5.4.1 shows the nature of the four epistatic parameters. The fit- 


Table 5.4.1. The fitnesses in Malthusian parameters for the ten possible genotypes 
when there are two loci with two alleles each. Below are the parts of the figure that 
contribute to the four epistatic parameters. For example, the first small table is 
composed of the four upper left squares in the large table. E, is positive when the 
average of the upper left and lower right square is greater than that of the other two. 


aa Aa AA 
bb mss mia maa 
Bb mis Mis = M23 maa 
BB mas maa maa 
mij M12 m;2 M22 mis M23 mia. maa 
mias mia m23 Mra Mas Maa ma. Maa 
Ei E E; Ea 


nesses of the genotypes are given in Malthusian parameters. Then, below, the 
table is decomposed into four parts, each comprising a quadrant of adjacent 
squares. If epistasis is "regular," the signs of the four epistatic parameters 
will tend to be the same. When E; has the same sign in all four of the sub- 
tables, then the direction of departure from gametic phase balance is clear. 
When it is inconsistent and changes signs, then there are opposite tendencies 
and it is not obvious which will predominate. It is necessary to work out each 
case separately. 

Figure 5.4.3 shows the change in Z in a diploid model. The time period 
shown here is much longer than in the previous figures, and it can be seen 
that Z slowly changes during the period of selection. However, it remains 
between the limits set by 5.4.21, 1.00 and 1.29 during the entire time. 

It is useful to go back to Table 3.7.1 at this point and see what could be 
predicted about selection in these systems. Notice that in models, 2, 6, and 8 
the E;'s are all positive; therefore, selection in these cases will develop positive 
linkage disequilibrium (Z > 1). Model 5 is the opposite; all E;'s are negative. 
Therefore it would build up linkage disequilibrium in the opposite direction. 
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Figure 5.4.3. Another illustration of the fast approach to quasi- 
linkage equilibrium and the slow change in Z thereafter when 
epistasis is small and linkage is loose. This is a diploid model with 
recombination of 0.2 between the loci. The fitness and epistatic 
parameters are given in the figure. 


Models 7 and 9 are mixed and the direction of departure depends on the 
gene frequencies. 
Table 5.4.2 shows the change in Z when E is large and linkage is tight. 


Table 5.4.2. Changes in chromosome frequencies and gametic phase unbalance 
(Z) in a diploid model with close linkage, c = .01. Fitnesses are: woe, = 1.10, 
WA -bb = WaaB- — .95, Wa-g- = 1.00. 


CHROMOSOME FREQUENCIES 


GENERATION s Z 
AB Ab =aB ab 
0 .250 .250 .250 1.00 
10 278 218 .268 1.67 
20 .294 .185 .336 2.87 
40 .263 .114 .508 10.02 
80 .025 .007 .961 483. 


Notice that in this case there is no quasi-equilibrium and Z continues to 
change. 
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The Effect of Epistasis on the Rate of Natural Selection Unless 
linkage is close or epistasis strong (unless E/c is appreciable), the considera- 
tions that we have just discussed lead us to expect that epistasis would have 
rather little effect on selection. Although it is hard to make such a statement 
quantitative, we can give some indication of the direction of the effect. We 
shall return to the question of the magnitude of the effect in Section 5.7. 

To be concrete, assume that the large letter genes generally increase fitness. 
Thus in Table 5.4.1, fitness tends to increase as we move from upper left to 
lower right. We have seen that if the E;'s are positive then Z tends to be 
greater than 1, that is, an excess of ab and AB chromosomes tends to develop. 
Those chromosomes that have the lowest fitness and those that have the 
highest, on the average, are the ones that are increased by epistasis. Thus the 
effect of positive epistasis is to make the population more variable. Since 
the rate of change by selection increases with the variability of the population, 
the effect of positive epistasis is to increase the speed of selection. 

Conversely, if epistasis is negative (the £,’s are negative), the mediocre 
chromosomes accumulate in excess. The result is a less variable population 
and slower selection. 


Linkage and the Establishment of Beneficial Mutant Combina- 
tions It is sometimes true that two or more genes that are individually 
deleterious interact to produce a beneficial effect. If these genes are newly 
arising mutants, and therefore rare, the population is in a troublesome 
situation. The essential situation is clear with a haploid model, so we shall 
consider this. Let the original population be ab. Suppose the fitnesses, w,,, 
Wab> Wag, and wag are in the ratio 1:(1 — 5): (1 — s5) : 1 + t, where the 
s’s and t are positive. If ab is the prevailing type, Ab and aB will be present 
in small numbers determined by their mutation rates and the magnitude of 
the s’s. Very infrequently the AB type will arise, either by mutation or by 
recombination between Ab and aB. This individual will ordinarily mate with 
the most common type, ab. With free recombination less than half of the 
progeny will be AB, and unless this type possesses an enormous selective 
advantage it will not increase in frequency. Only if the fitness of the AB 
type is great enough to compensate for the loss of AB types through recom- 
bination will the genotype increase. However, if for any reason AB becomes 
common so that many of the matings are with individuals like themselves 
or with Ab and aB, then the genotype can increase. This type of situation 
has been extensively discussed by Haldane (1931), Wright (1959), and Bod mer 
and Parsons (1962). 

One way in which the rare mutant combination might increase is when 
there is strong assortative mating. However, it is quite unlikely that the 
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mutant combination that was favorable for some reason would also happen 
to predispose the individuals to mate assortatively. 

The genes might also happen to be linked. If the rare AB individual 
mates with an ab type, which will usually be the case, the proportion of AB 
progeny will be proportional to ] — c, wherec, as before, is the recombination 
fraction between the two loci. However, the 4B type will increase from these 
matings if the extra fitness of the AB type is enough to compensate for this; 
that is, if (1 + (l — c) > 1, or t > c/(1 — c). 

The conditions are actually a little less stringent, because some of the 
matings will be with AB, aB, or Ab types. Furthermore there is a small 
addition by recombination from Ab x aB matings. These do not change the 
direction of the inequality, however, so we can say that linkage of such 
intensity as to give a recombination of t or less is sufficient to insure the 
incorporation of the double mutant. Actually the same algebra works for 
diploids, where t is the advantage of the double heterozygote over the pre- 
vailing type (Bodmer and Parsons, 1962). 

Whether such mutant combinations are important enough for this to be 
an important reason for having linkage is doubtful; but it does illustrate one 
situation where linkage introduces a qualitative change in the outcome, not 
just a change in rate. 


5.5 Fisher's Fundamental Theorem of Natural Selection: 
Single Locus with Random Mating 


Up to this point we have concentrated on the rates at which selection changes 
the genic and genotypic composition of a population. We are also interested 
in the way in which quantitative phenotypes are changed by selection. The 
selection may be natural or man-made. The character may be yield of a 
cereal, rate of gain in meat livestock, or whatever trait is of interest. In 
natural selection the trait of greatest significance is fitness—the capacity to 
survive and reproduce in the existing environment. 

It is clear that natural selection tends to preserve those genes which, on 
the average, increase the fitness of their carriers, and therefore to increase the 
fitness itself. It is also clear that this is happening to all competing species 
at the same time, so that with increasing time it requires greater and greater 
intrinsic fitness for a species to survive the steadily increasing competition. 
We are looking for an expression describing the rate at which fitness is 
increasing, while realizing that this is not likely to be reflected in increasing 
rate of population growth because of limitations of the environment and 
competing species. 

Intuition tells us that the rate at which selection changes the fitness will 
be related to the variability of the population. We shall show, as Fisher 
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(1930) first did, that the rate of increase in fitness is equal to the genic variance 
in fitness. If equations of the form of 5.3.5 are applicable, the theorem is 
precise. But we are primarily interested in it as an approximation applicable 
to a wide variety of population models. 

We start first with a simple case: a single locus, two alleles, no environ- 
mental effect, and random mating. 


GENOTYPE A14; A,A2 A242 
FREQUENCY p? 2p1P2 Di 

FITNESS my, =mMm+ ayy mi2=m+ ai2 m22 m+ a22 
GENIC VALUE m+2a, m ra, a2 m+ 2«s 


The average of the deviations from the mean must be 0; hence, as shown 
in 4.1.3, 


2pia, + 2p,p2(a, + a2) + 2p3.4, =0 
or, since p, + p; = 1, 
Pi% + P2% =0. 5.5.1 


The genic variance was shown earlier (4.1.5) to be 


V, = 2( pay + paa) 5.5.2 
with the obvious extension to multiple alleles, 
V, 22) pai. 5.5.3 


The quantity a; is called the average effect of the allele A;. The term 
was introduced by Fisher (1930). 


Alternatively, we can define a as the average effect of substituting A, 
for A,, in which case a = a, — a, and 
V, = 2p p22. 5.5.4 


However, we prefer 5.5.2 because it leads so naturally to an extension to 
multiple alleles. 
In the model we are considering, 


m = pim; + 2P1P2 M2 + pim. 
The rate of change in the average fitness will be 
din 
ae =2(pim,, + pama) P Pt + Apima + pama) 22 


d d 5.5.5 
P P 
= -2m,-— di : T 2m m, = , 
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where m, is the average fitness of the allele A, and m, is that of A, (5.3.7). 
But from 5.3.5, 


dp - dp Z 
p = p,(m, — m), a = p,(m, — m). 5.5.6 
Hence 
dm m d 
“de = 2pym,(m, — m) + 2p; m2(m2 — m) 
t 5.5.7 


= 2p,(m, — m)? + 2p,(m2 — my. 


The equivalence of the two expressions above may be shown by ex- 
panding the second to give (ignoring the factor 2) 


pi(m, — m)(m, — m) + p;(m; — m)(m; — m) 
= pım (m, — m) + pa mı(mı — m) — m(p,m, + pı m; — m). 
But, from 5.3.4, m = p,m, + p; m;; so the quantity in the last parentheses 
is 0. 
Now, note from 5.3.7 that m, = p,m,, + P2m,2, and therefore 
m, — M = Pym, + pom; m 
= p1d1: + P2412 5.5.8 
— a, 
from 4.1.9. Substituting into 5.5.7 gives 


a = 2p? + 2p,a3 = V, 5.5.9 
from 5.5.2. 

This example illustrates a special case of Fisher’s Fundamental Theorem 
of Natural Selection. In his words: ** The rate of increase in fitness of any 
organism at any time is equal to its genetic variance at that time.” Fisher’s 
genetic variance is what we are calling the genic or additive genetic variance. 

When the genotypes are in random proportions, as in the model we have 
been discussing, the genic variance in fitness can be written in another form 
that is sometimes useful. Combining 5.3.5, 5.5.5, and 5.5.7, we can write 


1 ny 1 (y 
V, = |=- (2 + D; dt |: 5.5.10 
or for multiple alleles, 


1 dp; i 
V2) 5 (2) : 5.5.11 
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The discrete generation analog of 5.5.9, related to 5.2.6 in the same 
way that 5.3.5 is related to the continuous formula, is 





V, 
Aw = £ | + PAP (wi, —2w,; + waa) : 5.5.12 
W 2w 


This can be derived easily as follows (Li, 1967a, b). We note first that 
with random mating 
W= wy Pi + 2wi5 pipa + W22 pi (p =1— p,), 
dw 
pg 2[p,(w14 — W12) + po(vio — w22)] 
pi 
= 2(a, =m a2) = 2x, 


by 5.5.8 when aj, = wi; — W, etc. 





d^w 
dp 2(w,, — 2w,5 + w22), 
1 
2 PiP2 dw 
PET oR dp, 


from 5.2.13. We now expand Aw = w' — w into a Taylor series, 


dw | (Ap) dw 


scan 49 | 
awe ap 2 dpi 





which is exact because w is a quadratic function of the p’s and all derivatives 
beyond the second are 0. Substituting, 





Aw = 2p, p28? 4 pipia? 


w 2w, +w 
w 9 ( 11 12 22) 





V, pip 
= | AE = (Wi, — 2w,5 + 22) 


as was to be shown. 

Unless selection is strong the quantity in brackets is very nearly |. This 
is especially true if several loci contribute to the trait and dominance is in 
different directions, for then the quantity in parentheses is sometimes positive 
and sometimes negative with considerable cancellation resulting. So, to a 
good approximation in many instances, 


Aw = 


gx 


! 5.5.13 
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or perhaps more meaningfully, 
Aw 


w w 


Lx 


N 
. 


That Aw is always positive may be shown by noting that 
Dipi(wii — 2w15 + W22) = Paar + PiW22 — Pii 
— 2p1P2 12 — pi Wi 
= Wr = W, 


where w; and w are the means of a completely inbred population and a 
randomly mating population. Substituting this into 5.5.12 gives 


Aw = Va (5). 5.5.14 
2w w 

Since all the quantities in this expression are positive, or 0, the fitness must 
always increase or be at an equilibrium. 

We shall not extend this to multiple alleles but the conclusion is still 
correct—the fitness with random mating and constant w;;'s can never de- 
crease. For references, see Mulholland and Smith (1959), Scheuer and Mandel 
(1959), and Kingman (1961, 1961a). 

The interpretation of Fisher's theorem has been a matter of consider- 
able discussion. Clearly, if the fitness of a genotype is defined as the intrinsic 
rate of increase, the average fitness cannot increase indefinitely, as the theorem 
would seem to say. The population growth rate may be 0, or its size may even 
decrease. One way in which this might happen is if the linkage relations 
change through recombination so that less favorable chromosomes increase 
in frequency despite natural selection. Another is if the fitnesses of individual 
genotypes change. For example, the fitness of a genotype may depend on its 
frequency or the frequency of other genotypes, or the environment may be 
deteriorating so that all genotypes become less fit. 

One interpretation of the theorem is to say that it measures the rate of 
increase of fitness that would occur if the gene frequency changes took place, 
but nothing else changed. The theorem thus gives the effect of gene frequency 
changes alone, isolated from the other things that are happening. Fitness 
becomes a rather abstract quantity that continually increases while the popu- 
lation size is roughly stabilized by the various factors that cause the environ- 
ment to change. This is what Fisher seems to be saying in at least one passage: 


For the majority of organisms, therefore, the physical environment may be 
regarded as constantly deteriorating, whether the climate, for example, is 
becoming warmer or cooler, moister or drier, and this will tend, in the majority 
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of species, constantly to lower the average value of m, the Malthusian parameter 
of the population increase. Probably more important than the changes in 
climate will be the evolutionary changes in progress in associated organisms. 
As each organism increases in fitness, so will its enemies and competitors 
increase in fitness; and this will have the same effect, perhaps in a much more 
important degree, in impairing the environment, from the point of view of each 
organism concerned. Against the action of Natural Selection in constantly 
increasing the fitness of every organism, at a rate equal to the genetic variance 
in fitness which that population maintains, is to be set off the very considerable 
item of the deterioration of its inorganic and organic environment. 


Alternatively, we can interpret fitness more concretely as the actual rates 
of change and introduce corrective terms to include the effects of linkage 
and epistasis, nonrandom mating, changes in the fitnesses of individual 
genotypes, and the effects of overcrowding and deterioration of the environ- 
ment. This we shall do in the next section. 


5.6 The Fundamental Theorem: 
Nonrandom Mating and Variable Fitnesses 


To derive the principle in its most general form, including the effects of 
epistasis and multiple alleles, is beyond the scope of this book. However, we 
shall give some indication of the nature of the extension to more complex 
situations at the end of this section. 

Fisher's (1930, 1958) treatment of the subject is recondite. A clearer 
discussion of the circumstances under which the principle works was given 
in Fisher (1941). For a straightforward, general derivation, see Kimura (1958). 

We shall remove the assumptions of random mating and constant fitness 
for each genotype. However, we shall continue to consider only a single 
locus with two alleles. 

We assign values as follows: 


GENOTYPE A.A, A14; A3 À; 
FREQUENCY P; 2P;i P22 
FITNESS Mi, S7 ay Mi2=At+a12 M22 =G@+ din 
GENIC VALUE Gi, =4+4+ 22, 1270-01-43. G22 =ā+2a: 


We follow the same procedure as was used in Sections 4.1 and 4.2. There 
we measured departure from random-mating proportions by the inbreeding 
coefficient, f. Here we find it more convenient to measure it in another way, 
by a measure 0 to be defined later. Although the inbreeding coefficient is a 
natural measure of the effects of inbreeding, random gene frequency drift, 
and some types of assortative mating, it is not the most natural measure to 
use when relating fitness change to the genic variance. For the rate of increase 
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in fitness to be equal to the genic variance it is necessary, not that f be con- 
stant, but that 0 be constant. 
The phenotypic variance in fitness is 


2 2 
V = P,,aj, + 2P,,a1, + P224},. 5.6.1 


However, we would expect selection to be more closely related to the genic 
variance, as defined in Section 4.1. In this case it is 


V, = P (241)? + 2P,;(x, + 22)? + P55Qa3Y". 5.6.2 


The a's are to be determined by least squares, following the same pro- 
cedures as before. We choose the a's so as to minimize the quantity 


Q = Pi, (m, — Gu) + 2P4;m; — G12}? + P354 m;; — G22)’ 
= Plai — 2a) + 2P;,(a,2 — 8, — az)? + P22(a22 — 2022). 


To minimize Q, we differentiate with respect to a, and æ, and equate 
to 0, giving 


Q 
e = — 4P (a1; — 224) — 4P,2(a1; — 8a, — a2) = 0, 
My 
Q 
= = — 4P,2(a,2 — a, — 42) — 4P22(a22 — 2a2) = 0. 
2 


Rewriting, we obtain 
Pay + P1242 =(Pi1 + P0, + Pii + P1395, 5.6.3 
P1;a,; + P5505; = (P12 + P22)02 + P458, + P2202. 5.6.4 


But, P,,a,, + P12412 =P1a1, where a, is the average excess of the 
allele A,. Likewise P,5a,2 + P22422 = p242. After making these substitu- 
tions in the left sides of the equations above, we then multiply the first by 
a, and the second by a, . Adding the two equations we get 


Pia% + Pr Qa, = 2P at + Pial, + 02)? + 2P,, 03. 
The quantity on the right is half the genic variance (cf. 5.6.2). Therefore 
V, = 2pya,a, + 2p2a, a). 5.6.5 


The extension to multiple alleles and to multiple loci is direct, so we 
can write, in general, 


V, = ayy Pi Gj %;, 5.6.6 


where the inner sum is over the alleles at a locus and the outer is over the loci. 
A comparison of 5.6.5 with 5.5.2, which was derived on the assumption 
of random mating, illustrates the fact that with Hardy-Weinberg ratios there 
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is no distinction between a and a. With inbreeding, a = a(1 +f), where f is 
the inbreeding coefficient. This is clear from the comparison of 4.1.6 with 
4.2.2. The difference between average excess and average effect of a gene is 
discussed in a different way by Fisher (1958, pp. 34-35). 

We now proceed to write an expression for the rate of change in fitness, 


m = PMi + 2P,)m,; + P22M22. 5.6.7 


If the m’s are no longer regarded as constant, 





dm dm dm dm 
de Pag ta R 
4 5.6.8 
dP dP P 
Tm; 3 + 2m, + M52 m 


The first three terms to the right of the equality sign are the average of 
the rate of change of the fitnesses of the individual genotypes weighted by 
the frequency of each genotype. Thus 

















dm;, dm, dm, dm 
P 2P — = —. 5.6.9 
nog tna a a 
Letting m;; — à + a;;, the last three terms in 5.6.8 are 
(aP, dPi; Ta) dP, dP, dP, 
a( ai +2 dt + dt 7, + 2412 ta di 5.6.10 


But, the quantity in parentheses must be 0, since it is equal to the total rate 
of change of the frequency of all genotypes. Because an increase in the pro- 
portion of one genotype must always be accompanied by a compensating 
decrease in the others, the net change is 0. 


We measure the departure from random-mating proportions by the 
quantity 


P, 

Oi; = — > 

Pi P; 

which is 1 for Hardy-Weinberg ratios. Substituting 0;; p, p; for P;; in the 
least-squares equations 5.6.3 and 5.6.4 gives 

pi0, 1411  Pip20150,5 = Pi% + Oy, pit, + O12 Pi P282, 
or 

P1011411 + P2012412 = 0, O1, Pye, + O12 P292, 


and in the same way 5.6.11 


P1912 G12 + P2022422 = %2 + O12 py, + O22 P202. 
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Likewise, since P; = 0;; pi pj, 








dP;, dpi 2 464, 
wr T PaP gr t Rm 

dP dp, dp, dé 
xi = O12 Pi dt — 461 P2 di * + Pip P 
dP, dp; 2 492, 





a = 262) P2 “a ePi di 


Substituting these into 5.6.8 and making use of equations 5.6.9, 5.6.10, 
and 5.6.11 leads to 








dm dm 
AT LO Qs Dupin + Ora paaa) TA 
dp, 
+ 2(a, + 0,5 pi1%, + 922 paa) ede 
, d6,, d6,; den 
+ aupPl—— di l + 2aiiPiPi— d pug pi v 
Now 
hu rogo qp cm = 
and 
dp dp dé da2 
01; Jr +a PE —pi es P2 dt ` p 


This can be shown by noting that p,0,, + p2 9,2 = 1, which on differentiation 
leads to 5.6.13. Substituting 5.6.13 and 5.6.14 into 5.6.12 leads to 


dm dm dpi dp; don 91 
E dt + 2a, — dt + 2a te bev dt +2P,,D "dt 


dp 
+ P2,D), s 


where Di = dij — «;—GQX, 0; Pi Pj = Pijs and $i = log. 0;; (i.e., d0/0 = dd). 
More concisely, 
dm dp, dp; do 


= feet an ee 
dt HV rains pou dt 
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Finally, from 5.3.5, 


dpi 
dt 





= pím,; — m) = pai, 
and since 2 Y' p;a;a; = V,, 


dm dm dọ 
cerunt qr. 6. 
dt g t dt + dt 5.6.15 


Although our derivation has considered only a single locus with two 
alleles, the formula is readily extended as was shown by Kimura (1958). 
The three terms on the right can be interpreted as follows: 


l. V, is the genic variance. 


2. dmjdt is the average rate of change in the fitness of the individual 
genotypes. If these are constant, as is frequently assumed, this term drops 
out. In a natural population the environment is continually deteriorating, 
primarily because of the evolutionary improvement of competing species. 
This term can be thought of as a measure of such deterioration. 

3. The third term measures the effect of gene interaction and departure 
from Hardy-Weinberg ratios. The quantity Dij; is the difference between 
the fitness of a genotype and its best linear estimate. It is thus a measure of 
the effect of dominance. The quantity $ = log? 0 is a measure of departure 
from Hardy-Weinberg proportions, being the log of the ratio of the actual 
frequency of a genotype to its Hardy-Weinberg expectation. 





Therefore, the term Ddd/dt will be 0 if there is no dominance or if the 
genotypes are in Hardy-Weinberg ratios. It will also be true under more 
general conditions. It is not necessary that $ be 0, only that its derivative 
be 0. In other words, there must be a constant amount of departure from 
random proportions, measured by 6. 

Numerical examples where the mean fitness decreases because the third 
term in 5.6.15 (extended to include epistasis) is negative have been given by 
Kojima and Kelleher (1961) and Kimura (1965). 

Notice that when the gene frequencies are changing, constant inbreeding 
coefficient (f) is not the same as constant 4. It is readily verified that when f 
is a constant other than 0, the third term in 5.6.15 is not 0, except in the 
absence of dominance (Crow and Kimura, 1956; Turner, 1967). 

We have mentioned before that, even with random mating, there will 
generally be departures from random genotypic proportions if there is 
selection. The Hardy-Weinberg proportions will be found at the zygote 
stage, but if there is differential mortality this will ordinarily lead to deviations 
from random expectations. We should not expect this to alter greatly the 
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fundamental theorem however. If gene frequency changes are slow, the pro- 
portions of each genotype in the population will be a constant multiple of 
the random-mating zygotic proportions; that is to say, the 0;;s will be 
approximately constants. Thus the condition d log 0/dt = 0 will be approxi- 
mately correct with random mating, so the rate of fitness change is still given 
approximately by the genic variance. 

For discrete generations the formula is similar. With random mating, 
the equation is 


aS 





5.6.16 


+ Aw 


=I|N 


approximately. For a general discussion, see Wright (1955) and Li (1967). 
Somewhat more generally, removing the assumption of random mating, 
we have 


x 


Aw =— +) APSDi + Aw. 5.6.17 


W 
For a derivation of this, see Kempthorne (1957, p. 358). The close relation- 
ship of this to 5.6.15 can be seen by noting that 


d 
DÊ- EPD; © log 0; = 2D Pi 5.6.18 


which is equivalent to the middle right term in 5.6.17 for the continuous case. 

The extension of 5.6.15 to multiple alleles and multiple loci with epistasis 
is not difficult in principle, but involves considerable algebra. It is given by 
Kimura (1958). The equation may be written 

dm dm do 

q WS T 5.6.19 


where e is a measure of the fitness of a genotype, expressed as a deviation 
from the linear least-squares expectation, and therefore is a measure of both 
dominance and epistasis, and $ = log 0 is a measure of the departure from 
random-mating expectations of the frequency of each genotype. 

An explicit expression for this term with two loci with linkage and 
epistasis, but with random mating, is given in the next section. It will be 
shown that with random mating the effects of linkage disequilibrium and 
epistasis effectively cancel, as long as epistasis is weak and linkage not too 
close, so that the last term in 5.6.19 becomes unimportant. 
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There has been considerable discussion in recent years as to whether 
Fisher's theorem is * exact." It is clear from the derivation as we have given 
it that the rate of change in fitness is equal to the genic variance if (1) the 
assumptions of population model 2 apply, that is, equations of the form 
dp/dt = p(m — m) are appropriate, (2) the genotypic fitness coefficients are 
constant, and (3) the departures from random-mating proportions, as 
measured by 0, are constant. 

However, no model can ever be an exact description of nature. We have 
not considered such complexities as arise when one pays careful attention 
to the pattern of deaths at different ages, the various ways in which mating 
combinations occur, different fitnesses of the genotypes in the two sexes, and 
such. We are mainly concerned with rather slow selection in which the details 
are of less influence. Fisher's principle, we believe, summarizes a great deal 
of biological complexity in a simple and elegant statement that relates popu- 
lation genetics to statistical theory in a very useful way. But of course, as a 
description of nature it (like all other descriptions) is only an approximation. 

The effect of overcrowding can be included in the formulation. We let 


the actual rate of increase in a population of size N be given by M, so that 
1 dN 
——=M=m—-wW(N 
N di m — WN) 


where V (N) is a function describing the reduction in population growth rate 
with overcrowding. 


dM dm dy 
di dt dt 
dm  dydN 
"dt dN dt 
5.6.20 
_ dm dw 
^ dt dN 
dm d$ dy 
=V D -MN , 
E rt al dN 


where dy/dN describes the increase in resistance to population growth with 
overcrowding. 
Fisher (1930, 1958, p. 46) writes 


dM y p. M 
di = C 5.6.21 


Our formulation is similar. W is V,, D is our dm/dt, and 1/C is equivalent 
to N(dy//dN), treated as a constant. 
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5.7 The Fundamental Theorem: 
Effects of Linkage and Epistasis 


In most natural populations the assumption of random mating is a good 
approximation to reality. Furthermore, most pairs of genes are unlinked or 
loosely linked. In so far as a single locus is concerned, regardless of the number 
of alleles, the Hardy-Weinberg ratios are attained in a single generation; so 
this is a reasonable assumption. On the other hand, gametic phase equilibrium 
is approached only asymptotically, and with epistatic interactions there is 
permanent ‘‘linkage”’ disequilibrium even for unlinked genes, as was dis- 
cussed in Section 5.4. For a discussion of this problem, see Bodmer and 
Parsons (1962), Kojima and Schaffer (1964), Lewontin (1964), Wright (1965), 
and Felsenstein (1965). 

Therefore, it is necessary to consider the effects of linkage disequilibrium 
when there is epistasis, even when there is random mating. It is not imme- 


diately clear what this does to the term Ddd/dt (or rather, the corresponding 
term in the more general equation when epistasis is included, 5.6.19). 

However, we shall now demonstrate that there is a remarkable property 
of randomly mating populations that is true unless linkage is very close or 
epistasis is very strong. This is that the amount of linkage disequilibrium 
maintained by selection is just enough to nullify the epistatic contribution 
to the variance, so that the genic variance remains the correct measure of 
the rate of change in population fitness. 

We are assuming the same model as in Section 5.4, but we shall repeat 
the basic elements now for convenience. 

Assume that there are two loci, each with two alleles. There are thus 
four gamete types. We shall assume that the loci are linked with recombina- 
tion frequency c. If the loci are on independent chromosomes, then we regard 
this as the special case where c — .5. We assign symbols for the chromosomes, 
frequency, and average fitnesses as follows: 


CHROMOSOME (GAMETE) TYPE ab Ab aB AB 
FREQUENCY pi p2 P3 D^ 
AVERAGE FITNESS mı ma m3 ma 


Fitnesses are measured in Malthusian parameters. For example m, is the 
average fitness of all genotypes containing chromosome ab, weighted by 
their frequency and the number of ab chromosomes carried (1 or 2). 

We make use of the gametic or marginal values in Table 5.7.1. All the 
necessary equations can be expressed in terms of these values because, under 
random mating, the zygotic frequencies are given by the square of the gametic 
frequencies. In this way we effect a considerable saving in algebraic manipu- 
lations. 


218 AN INTRODUCTION TO POPULATION GENETICS THEORY 


We now show that under quasi-linkage equilibrium the Fisher theorem 
is correct. We would expect this to be the case, for quasi-equilibrium provides 
a nearly constant departure from gametic phase balance; in other words, 
the third term on the right side in equation 5.6.19 should disappear. However, 
we shall demonstrate this explicitly to illustrate how the epistatic-variance 
component may be isolated. 


Table 5.7.1. Frequencies and fitnesses of the various zygotic combinations of two 
linked loci. The gametic values are given along the margins, and the zygotic values 
in the main body of the table. 


ab Ab aB AB 
pi pipa Pips Pipa Frequency 
ab : 
Mir mia mia Mia Fitness 
P2Pi pi pipa P2ps Frequency 
Ab : 
mi maa mas maa Fitness 
aB pipi PsPz p3 psp. Frequency 
ms, m32 mas M34 Fitness 
Papi Papa PaPs pa Frequency 
AB : 
ma, maa mas maa Fitness 
Totals pi pai Ps Ps Frequency 
Averages m, m2 ms ma Fitness 
m mca m+B m+a+f Genic value 


m — 2 2 Pipim; = > Pim 
m, = > pm, 
J 
Mij — my 
Pa = pz + Pa, Ps = ps + pa (gene frequencies) 


_ Pipa 
P2ps 





D = pipa — P2 P», Z 


From Table 5.7.1 the mean fitness is 


m=} pp,mi; =} Pi ni 5.7.1 
1 J i 
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and the average fitness of the ith chromosome is 


Y ppm 
j j j = oP p;mi. 


m, = 5.7.2 
Pi 

The rate of change in fitness is 

dm dp; 

m Endo rnt) 

-Y5 y nm + pE Pim ij 
" 2 5.7.3 
= EPI 3. " 
2 T dt ru 
dpi 
=2 Eus 
im dt 

since the two expressions on the right are the same. 

Substituting into 5.7.3 from 5.4.4 and rearranging gives 

ldm 

5yr T È Pim? — M $ pim — cbD(m, — m, — m, + ma). 5.7.4 
But, from 5.7.1 and 5.7.2, 

Y pm =ñ. 5.7.5 
Recalling 5.4.9 and using 5.7.5, 5.7.4 becomes 

1 dm 

5 3T =) p,m? — m? — cbDE. 5.7.6 


However, Y. p, m? — m? is the variance of the m,’s, which we shall call the 
marginal or gametic variance, Vam- Refer to Table 5.7.1. This includes 
components from epistasis, but not from dominance. Thus 


dm 


W^ = 2(V;am — cb DE). 5.7.7 


We now use the familiar least-squares procedure to estimate the additive 
component of the gametic variance. The quantity to be minimized is, from 
Table 5.7.1, 

Q = p,(m, — m? + p:(m, — m — ay! + p,(m, — m — fy 

+ pm,- m —a -— py. BAR 
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Taking derivatives and equating to 0 to minimize gives: 


10 
(1) LCE = pim — m) - palma — m — a) — pm, — m — f) 
m 
— p.(m, — m—a— f) 20 
1 0Q 
(2) sa ce Ee) pm rre mese 
x 
O 3 5 = am- m- B) - pm - m-a- 8) =0. 
Let 
Py(m, — m) = K. 


Then, after subtracting (3) from (1), 
— pim; — m — a) « K. 
Likewise, after subtracting (2) from (1), 
— p3(m3 — m — B) = K. 
Therefore, also 
pa(m4, — m —a — p) = K. 5.7.9 
Dividing these four equations by Pi, P2, P3, and p4, respectively, and 


adding, we obtain 


l 
Hp cap m emek c 5.7.10 
or, using 5.4.9 and 5.4.10, 


E = KP. 5.7.11 
Q, which is the nonadditive component of the marginal or gametic 
variance, is, from 5.7.8 and 5.7.9, 


] 
a ae K?P. 5.7.12 


Thus, V, = V, + K 2P, where V, is the additive component of the gametic 
variance. Substituting into 5.7.7 yields 


pa 
= = XV, + K?P — cbDE) 
E? 
E (v. t—-— cbDE) 5.7.13 
P 
E dZ 
m, SEM 
(v. * PZ =) 
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We showed earlier (Section 5.4) that with loose linkage (cb > |E]) 
Z attains quasi-equilibrium. So, when there is quasi-equilibrium the last 
term in 5.7.13 vanishes. 

We can write 5.7.13 in two ways, recalling that the genic variance V, is 
twice the additive component of the gametic variance, V,, since we are 
assuming random mating, and that 2V,,,, = V, + 3V,, (see 4.1.19). 


dm ny 2E dZ 

"xdi db re 5.7.14 
dm 1 

aE = V,+ ) Via — 2cbDE. 5.7.15 


The first equation is appropriate when there is quasi-equilibrium and 
the second is more useful for strong epistasis and tight linkage, although 
both are correct and in fact fully equivalent. However, under the one cir- 
cumstance the last term of the first equation tends to O and in the other the 
last term in the second equation does. 

We therefore see that with free recombination natural selection operates 
in such a way that just enough gametic phase disequilibrium is generated 
(—cb DE) to cancel exactly the epistatic variance (K?P). Therefore the rate 
of change in fitness is given by the genic component of the variance, as the 
Fisher principle says. 

From this, it would appear that parent-offspring correlations do not 
involve any significant epistatic terms if the population is near quasi-linkage 
equilibrium. In fact, the assumption of quasi-equilibrium is probably closer 
to the truth than the conventional simple assumption of gametic phase 
equilibrium. It is therefore quite possible that the epistatic components of 
variance that are sometimes added to the expressions for parent-offspring 
correlations (e.g., Kempthorne, 1957) may be making the expressions less 
accurate than when only the genic variance is used. 

A numerical example is shown in Table 5.7.2. The slow change in linkage 
disequilibrium, as measured by Z, is evident. Notice how, after the first few 
generations, the change in fitness is given very closely by the genic variance, 
despite great changes in the genic and the total gametic variance. However, 
in the very first generation, the change is given more accurately by the total 
gametic variance (doubled), because the proper level of linkage disequilib- 
rium has not been attained. 

A similar example, but with less recombination, is shown in Figure 5.7.1. 
Again, as soon as quasi-equilibrium is attained, the rate of change in fitness 
is given by the genic variance. 

With very tight linkage the rate of change is given more closely by 


2V,am than by V,. The reason is obvious; with very little crossing over the 


gam 
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Table 5.7.2. The attainment of quasi-equilibrium with random mating and free 
recombination. The genotypic fitnesses are: aa bb, 1.02; aa B-, .985; A- bb, .99; 
A- B-, 1.00. The recombination frequency is 1/2. The population starts in gametic 
phase equilibrium, and disequilibrium is very slowly generated. Discrete generation 
model. (From Kimura, 1965.) 









TWICE 
CHANGE GENIC GAMETIC 
LINKAGE IN VAR- VAR- 
GENER- DISEQUILI- FIT- TANCE  IANCE 
ATION CHROMOSOME FREQUENCIES BRIUM NESS V, 2V un 
AB Ab aB ab zZ x 105 x 105 x105 
0 .200 .200 .300 .300 1.000 2.93 0.73 2.91 
10 .2001 .204 .291 .304 1.028 .66 .66 2.91 
50 .196 .224 .267 .313 1.029 .46 .45 2.85 
100 .186 .243 .244 .328 1.030 .37 .37 2.98 
200 .146 .259 .210 .385 1.035 .93 .92 4.24 
300 .073 .229 .162 .536 1.049 5.72 5.64 10.09 
400 .005 .085 .050 .860 1.081 15.29 15.42 16.78 
500 .000 .006 .003 .991 1.095 1.68 1.72 1.73 
0.00020 - 
0.00018 - ER 
0.00016 - ets, se N 
0.00014 - E Sacr) 
0.00012- y 
0.00010 - Ka 
0.00008 - 
0.00006 
0.00004 
000000 7 TTT 





o 
0 50 100 150 200 250 300 350 400 450 500 550 
H 


0.30 0.32 0.33 0.36 040 046 0.56 0.73 0.89 0.97 0.99 0.999 
D, 


Figure 5.7.1. The genic variance (lower line) and the genic variance 
plus half the additive x additive epistatic variance (upper line). Note 
that the rate of change in fitness, after the attainment of quasi- 
equilibrium, follows almost exactly the lower line. The example is 
the same as Figure 5.4.3. 
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chromosomes behave as units. Notice, though, that in neither case does the 
dominance variance contribute to the rate of change in fitness. 
Table 5.7.3 shows an example of the latter kind. The data are the same 


Table 5.7.3. The same population as was shown in Table 5.4.2, showing that 
with very low recombination and high epistasis the gametic variance (x2) is a 
better predictor of the rate of change in fitness than the genic variance. 


TWICE 
CHANGE IN GENIC GAMETIC 
GENERATION FITNESS VARIANCE VARIANCE 
x 10° x 105 x 105 
0 31.84 .00 31.25 
10 38.76 1.46 41.72 
20 53.56 8.79 61.31 
40 144.62 79.88 168.73 
80 82.82 76.65 99.29 
120 1.41 1.35 1.71 


as in Table 5.4.2. Recombination is low (c — .01) and epistasis 1s large, so 
that E/c is much larger than |. As is apparent in the table, the doubled 
gametic variance is a better predictor than the genic. (The doubling is 
simply because the diploid individual with random mating is derived from 
two independent gametes.) For several generations this is the best predictor, 
but later this turns out to be an overestimate. Notice, though, that there is 
no contribution from the dominance variance to the prediction—it is already 
an overestimate without dominance. 

To return to the more usual situation—after all, most pairs of genes 
are not closely linked—we should not conclude that epistasis can have no 
effect on the rate of fitness change just because this is governed by the genic 
component. Epistasis, by altering the total variability in the population, can 
change all components of the variance, including the genic. This change 
could be in either direction. 

Wright (1965 and many earlier papers) has emphasized that the most 
important kind of epistasis is probably the kind that arises from the fact 
that intermediate values for most metrical traits are optimum from the 
standpoint of fitness. For example, individuals far from the mean for size 
in either direction have lower survival and fertility rates. This would produce, 
generally, epistasis such as to make E negative. Hence the variance would be 
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reduced. This would then make the population slower to respond to direc- 
tional selection; so we can say that in general the effect of epistasis on such 
metrical traits is probably such as to slow somewhat the rate of progress by 
selection, progress being measured by the rate at which the mean for the 
trait is changed. 

We have considered only that part of the total epistasis that is caused by 
interaction of pairs of genes. There are of course possible higher order inter- 
actions. It is generally believed that for quantitative traits these are not very 
important. Indeed, for one particular model, studied by all three pioneers— 
Haldane, Fisher, and Wright—they make no contribution. This is a model in 
which the fitness decreases in proportion to the square of the distance of the 
metrical trait from the mean. Wright (1935) showed that with this model all 
of the epistatic variance is contained in 2-locus interactions and that higher 
order interactions make no contribution. This is not true for most other 
models, but this case suggests that the magnitude of the contribution of 
higher order effects may be small. 

One other possibility is being studied currently by several investigators. 
This is the effect of several linked genes on the chromosome. Even though 
the outer members may be only loosely linked with each other, there may be 
associations because they are both linked with intermediate genes. So far 
there is no general theory for this, although computer simulations suggest 
that there may be such effects, and that departures from gametic phase 
balance may thereby be generated (Lewontin, 1964). 

Two final points connect this section with earlier discussions. One is that 
twice the gametic variance, as we have used it here, includes half the additive 

x additive component of the epistatic variance, V44 (see Section 4.1). 

The other is that, because of inevitable departures from Hardy-Weinberg 
proportions after selection, the genic variance is not exactly the sum of the 
additive portions of the two gametic variances. However, for the same 
reason as given before, if gene frequencies are changing slowly the departure 
from random proportions is roughly constant and the genic variance gives 
the rate of change in fitness. 


A Note on Terminology We have used the word fitness as a synonym for 
the selective or reproductive value of a genotype. It may be either an absolute 
value, measured by the number of progeny per parent, or it may be relative to 
some reference genotype. For a discontinuous model we have used w and for 
a continuous model, m, as measures of fitness. The average fitness, w or im, is 
a function of the individual genotypic fitnesses and the frequencies of the 
various genotypes, and again may be absolute or relative. 

The word is also used more restrictively. In his recent discussions 
Wright (1955, 1969) used the word fitness for that population function that 
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increases with time according to Fisher's Fundamental Theorem of Natural 
Selection. It is not in general the same as log, w or m. As we have seen, 
dm|dt — V, only under certain conditions such as constant individual m's or- 
w's, random mating or constant departure therefrom, loose linkage, and quasi- 
linkage equilibrium. In fact dm/dt may even be negative. 

There is value in defining a quantity which always increases and which 
measures the theoretical evolutionary improvement in a population brought 
about by gene-frequency change, despite the fact that this is hardly ever 
experienced as a corresponding increase in population numbers because of 
such things as competing species, overcrowding, frequency-dependent 
selective values, and changes in the mating system and linkage relations. Such 
a quantity, called fitness by Fisher and the fitness function by Wright, always 
increases at a rate equal to the genic variance. For a discussion of this quantity 
and the way in which it can be defined, see Wright (1969, p. 121). 


5.8 Thresholds and Truncation Selection 
for a Quantitative Trait 


Some situations in nature approach a model in which nothing happens until 
a certain quantity is attained and then all values above this show the phenom- 
enon. For example, it may be that doses of a drug may be harmless up to a 
point and that beyond this point they become harmful. There may be in some 
instances similar kinds of gene action; an effect of some kind appears only 
when a certain number of harmful genes are present. This has been suggested 
by Lerner as the basis for some congenital anomalies in chicks. 

The existence of sharp thresholds in nature is open to discussion, but it 
is clear that this model is approached rather closely in some kinds of breeding 
experiments. All the individuals above a certain level are saved and reproduce; 
the rest are culled. 

Since mass directional selection must be very similar to natural selection, 
we should be able to adapt the theory developed in these chapters to this 
subject. 

We assume that the trait under consideration is determined by a large 
number of factors, genetic and environmental, the number being large enough 
that the effect of any one locus is small relative to the total variability. The 
quantitative trait (e.g., yield, Y) is assumed to be normally distributed (see 
A.5). If the data are not normally distributed they can often be transformed 
to be approximately normal. For example, one might work with the logs, 
or the square roots, or some other function of the original measurements 
that would have a symmetrical distribution approximating the normal. 

The breeder saves for reproduction a certain fraction, S, of the popu- 
lation. All of these lie above the cutoff or truncation point, C, on the abscissa. 
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This is illustrated in Figure 5.8.1. This method of truncation selection is 
equivalent to a threshold at point C. We assume that all the animals or 
plants that are saved are equally fertile, or more generally that there is no 
correlation between fertility and the yield beyond the truncation point. Let z 
be the ordinate of the normal curve at the truncation point. 





Figure 5.8.1. Truncation selection. The 
individuals in the shaded area are saved for 
breeding; the ones to the left are culled. 


Assuming a normal distribution, the frequency f ( Y) of individuals with 
yield Y is given by 


5.8.1 


I (Y- YP} 
f «RP S| 


where c? (=V,) is the total variance in yield from all causes, genetic and 
environmental. 
The proportion saved, S, is related to the cutoff point, C, by 


ES [ f(Y) dY. 5.8.2 
C 


The mean value of the group chosen to be parents is given by 


m a 
y= = Y f(Y) dY. 5.8.3 


Then, the difference between the mean of the selected parents and the popu- 
lation average (called the selection differential, 7) is 


em ades bce eed 
I- Y -Y-g|f Y f(Y)dY — ys 


D. Yf(Y)dy - Y fi IY) ay| 5.8.4 


vl — 


Les E 
-gJ, OX - Yon ax 
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We now let 
Y-Y C—-Y dY 
= y, = C, dy = —. 
o o o 
Then 
o l pm zz» g. I = 
I=- —= | ye? dy =- —— e? 
$72 S 2n 
5.8.5 
o 
——2; 
S 


where z is the ordinate at the cutoff point. 

Common sense tells us that the selection differential, / (or Y'— Y in 
the original units), increases as the selection intensity is increased. But perhaps 
it exaggerates the amount by which the effectiveness of selection, as measured 
by J, increases with a decreasing proportion of individuals selected to be 
parents. Some numerical values are given below. These can be obtained 
from a table of areas and ordinates of the normal integral. 


I 
S ER 

g 
.80 .34 
.50 .80 
.40 97 
.20 1.39 
.10 1.76 
.01 2.66 


Thus, increasing the intensity by 10-fold (i.e., decreasing S from .1 to .01) 
changes / by only 2.66/1.76 or about 50%. 

For example, suppose a plant breeder would like to select a strain for 
resistance to two independent virus diseases. Suppose also that the fertility 
is such that he can afford to discard 99% of the plants each generation, 
saving 1 %. In two generations he could select intensively for one disease the 
first generation, selecting the most resistant 1%, and do the same thing for 
the second disease the second generation. He would thus have a selection 
differential, /, of 2.666 for each disease. On the other hand, he could save 
the most resistant 10% for each disease each generation (thus saving 1 % of 
the plants, as before). He would have a differential of 1.766 for each disease 
each generation, or 3.520 in two generations. Thus he could expect to make 
considerably more progress by selecting simultaneously for the two traits 
than by selecting one at a time. 
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We now want to relate this to the genetic composition of the selected 
individuals. 

Let Y be the mean yield of the population and Y, be the mean yield of 
genotypes carrying A, . The frequency distribution of the yield of these geno- 
types, assuming that the effect of this gene is small relative to the total 
variance, c?, is 











(Y-Y) 
fi(Y) = — exp| - n : 5.8.6 
Let 
Y,-Y Y-Y 
i= 2 ics s 5.8.7 


and note that 
(Y- Y! =Y- Y)- Q - Y)? 
-(Y- YP- XY —- YXY, - Y)«(Y, - Y’. 
Then 


L= LUE ——en|-% eye m 


2 
-f(yrxn»tv e ); 


since y, is assumed to be very small, terms involving higher powers of y, 
can be neglected. 
The proportion saved of genotypes carrying 4, is given by 


— f(Y)exp pv = 5] 5.8.8 





œ © Y =Y 
S= f| AAY =| romar +y, f f(y) ay 
c c c 
I 5.8.9 
o 
(from 5.8.2 and 5.8.4). 
Now the fitness of A, relative to the population as a whole is 
W S, 
W S 
or 
W.-W S,-S I 
Tm--— 5.8.10 
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The average yield, Y, in a randomly mating population is 
Y= piYi + 2pip2 Yi + Pi Y22- 
For small increments, so that products of the Ap’s can be neglected, 


AY = 2(p:¥i1 + p2 Yv32Api + 2(py Yya + P2 Yo2)AP2 


5.8.11 
= 2(Y,Ap, + Y,Apz2), 
where Y; is the average yield of allele 4;. 
From 5.2.1, 
pi(W, — W) p2(W, — W) 
Ap, = EL DE. Ap; = —— ToC 5.8.12 
Substituting into 5.8.12 from 5.8.10 gives 
I I 
Ap, — PUY Ap, = ees, 5.8.13 
c c 


But, recalling that y; — (Y; — Y)/c, substituting 5.8.13 into 5.8.11 yields 


AY M [Pn = Y) ES P2 Y2(%) — 2] 
c c c 


2I v 5.8.14 
ot [p,(Y, — Y)? + p2(¥2 — Y]. 
But Y; — Y=a;, the average excess of the allele A;, so 
ac 2i 
AY — 5 [pai + p203], 5.8.15 


which, from 5.5.2, recalling that with random proportions there is no dif- 
ference between the average excess, a, and the average effect, a, yields 


2 y 
AY-I-, 5.8.16 
V, 
where V, = o°, the total variance. 

V/V, is the heritability, as we have discussed before (Section 4.1). This 
is the familiar formula of the animal or plant breeder: The expected increase 
in yield from selection is the heritability multiplied by the selection differential. 

As a numerical example, suppose the heritability of 6 months weight in 
swine is 0.25. Suppose the herd average is 100 pounds. If we select as parents 
a male who weighs 130 pounds at this age and a female who weighs 110, 
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their average is 120. (In practice the weights would be adjusted for sex dif- 
ferenccs.) Then the progency would be expected to exceed the population 
mean by 0.25(120 — 100) or 5 pounds; so we would expect the litter to 
average 105 pounds. 

Equation 5.8.16 is the same as derived earlier by an entirely different 
procedure (see 4.4.9). We have done it again in this way to show its close 
relation to the Fundamental Theorem of Natural Selection. Whether such 
terms as V44 should be included with V, depends on whether the population 
is in quasi-linkage equilibrium. 


5.9 A Maximum Principle for Natural Selection 


The effectiveness of selection in changing the composition of a population 
depends on the genic variance of the population, as we have shown in the 
previous sections. We can then ask: Are the gene frequency changes such as 
to produce the maximum increase in fitness that is possible within the restric- 
tions imposed by the amount of genic variance? In simple cases the answer 
is yes. Furthermore, we can use this principle to deduce the equations for 
change in the genetic composition of the population. 

In its simplest form the Maximum Principle says: Natural selection acts 
so as to maximize the rate of increase in the average fitness of the population. 
Stated more explicitly: With a given amount of genic variance in a short 
time interval the gene frequencies change in such a way that the increase in 
population average fitness is maximized. 

This principle holds under the same conditions as the Fundamental 
Theorem of Natural Selection. These are: (1) Fitness is measured in Mal- 
thusian parameters; (2) the gene frequencies change continuously such that 
equations of the form of 5.3.5 are applicable; (3) the genotypes are in random 
proportions or the departure therefrom is constant (ie., the rightmost term 
in 5.6.19 vanishes), and (4) the fitness coefficients are constant (i.e., the next 
to right term in 5.6.19 vanishes). 

Consider now a single locus with genotypes in random-mating propor- 
tions. The average fitness (from 5.3.4) is 

m= 2x mi; pip; — > mip,. 5.9.1 

iJ i 
For any set of small changes in gene frequencies, dp,, óp;, etc., the change 
in mean fitness is 


óm — 2. m, p; Op; + 2, mu Bi OD; 
bh I, 


e Y m; Ôp; . 5.9.2 
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To demonstrate the principle we shall show that maximization of ôm 
leads to the correct equations of gene frequency change. This is maximized 
in the usual way, by equating the derivative to 0 subject to the appropriate 
conditions. There are two of these. The first is 


Y. 6p; = 0, 5.9.3 
which follows, since 
Yr = > (yp; + 6 p) = 1. 5.9.4 


The second condition is that the genic variance is (momentarily) con- 
stant. With random proportions the genic variance is 


V, =2> pm, — my 5.9.5 


from 5.5.7 and 5.5.9. Alternatively, 


4V, =) pm} - m. 5.9.6 
From 5.5.11 
1 1 (dp;\* 
=V, =% —|—]|. 9. 
205 Ez) 2d 


So, we write as our second condition 
l 
Y; — (6p;)? = C(6t)?, 5.9.8 
Pi 


where C is a constant, equal to Y' p,(m; — my. 
The most direct procedure for maximizing ôM is by the use of Lagrange 
multipliers (A.10). We let 


1 
y —óm-4 aly. " (êp) — cen? +BY p, 5.9.9 
where A and B are undetermined constants. Equating Ov/O(Óp,) to 0 gives 
A 
2m, + «d op; + B — 0. 5.9.10 
Pi 


We now multiply each term by p; and sum this over all alleles, i= 1,2, ..., k. 
Noting 5.9.1 and 5.9.2 we find B = —2m, and 5.9.10 becomes 


_ Pimi- my 


ôP; — A 


5.9.11 
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Substituting this into 5.9.8 yields 
Y pm; — my = A! C(9ty. 5.9.12 


If we note the definition of C in 5.9.8 we see that 47(6t)? = 1, and 1/4 = ôt. 
The appropriate value is —1/A = ôt (we want the favored gene to increase 
in frequency, not decrease), so, substituting this into 5.9.11, gives 


Op; 
r. = pi(m, — mi), 5.9.13 


which is equivalent to 5.3.5. Thus, the familiar equations for gene frequency 
change follow from the principle of maximum increase in fitness. 

This is reminiscent of the principle of least action and Hamilton's 
principle in classical mechanics, whereby a minimization or maximization 
process leads to Newton's laws of motion. 

The results for a single locus are readily extended to multiple loci if 
there is no epistasis and the loci are in gametic phase (linkage) equilibrium. 
This is discussed by Kimura (1958). 

What if the fitness coefficients are not constant? We have seen earlier 
that under this circumstance the fitness does not change in a simple way and 
in fact may not increase at all. In this case we replace 6m by the value that 
it would have if the fitnesses were constant; that is to say, we forget that the 
m;;s are variable and maximize ôm = 2 5° mjóp,. 

With epistasis and linkage the principle works if the population is in 
quasi-linkage equilibrium. We shall illustrate this with the same model used 
before in Section 5.7. As we did then, we let p,, P2, P3, and p, stand for the 
four gamete (chromosome) types and let Z = p, p4/p; p, be the measure of 
linkage disequilibrium. We state the Maximum Principle as before: For a 
given genic variance, natural selection produces gene frequency changes dp; 
so as to maximize 2 5. mjóp, (or, if the n71;;’s are constant, to maximize the 
change in fitness óm). 

In addition to the two side conditions used earlier, 


» - (Spi) = Vt)? = K(óty 5.9.14 
i 


and 
x 6p; = 0, 5.9.15 


we have the additional condition imposed by the necessity for quasi-equilib- 
rium. This asserts that Z (or log Z) is constant, which is the same as saying 
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that its differential is 0. Thus the third condition is 


1 1 1 1 
ô ln Z = —óp, — —óp, — — dp, + — dp, = 0, 
Pi Pi D; P2 P; Py Pa Da 


or 
k. 
Y, — dp, - 0, 5.9.18 
Pi 
where 
k; = l, 


when i| = l or 4, 
k;— —1, 
when / = 2 or 3. 


Proceeding as before, we let 
1 ] k; 
b= 2Y m öp + A[Y E (py? - KGN?) + BD 5p) + (Xp) 


5.9.17 
where 4, B, and C are undetermined constant multipliers. Differentiating, 


ow 2A k; 
L =2m +—op,;+B+C—=0. 5.9.18 
a(6p;) Pi d Pi 


First, multiply through by k; and add. This gives 
k, 1 
2) 243. Opp CX eso. 5.9.19 


But, the first term is equal to 2E (from 5.4.9), the second is 0 (from 5.9.16), 
and the third is CP (from 5.4.10). Thus 


2E 
Qum. 3. 
P 5.9.20 


Now, multiply 5.9.18 by p; and sum, giving 
2E 


But, since ) dp; = 0 and Y. k; = 0, 


B = -—2m. 5.9.21 
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Substituting for B and C in 5.9.18 gives 
k;E 


m; — m) — —— 
5p. = Pi ) P 5.9.22 
Pi = =A 
The genic variance is 


2E? 
V, =2 >: pm; = in)? = "P 5.9.23 


(see 5.7.13 and earlier equations). Substituting these expressions for dp; and 
V, in the left and center terms of 5.9.14 gives 


1 1 k; E]? N cb. 
E e T NEN a RS Imami =e: 
aE [pdm — mm) — = [Est - m - =] 
Expanding the left side, 
1 


A i Ea S 


Pi 
1 E? 
Aie 


Therefore, 4?(ó1)? = | and, choosing the right sign, 4ót = —1. Equation 
5.9.22 becomes 


k;E 
Op; = a — m) — | ot. 5.9.24 


But, at quasi-equilibrium, E/P = cb D, so we obtain 
Op; = [p:(m; — m) — k,;cbD]6ét. 5.9.25 


Comparison of this with 5.4.4 shows that we have derived the equations for 
the rate of change of chromosome frequencies. 

The principle can be extended to situations where there is not random 
mating and the population is not in quasi-equilibrium, but we shall not do 
so here. For a discussion, see Kimura (1958). For a single locus not in 
Hardy-Wienberg ratios, we state the Maximum Principle as follows: 

Natural selection acts to change the gene frequencies in such a way 
that )' m; 5p; is maximum for a given value of 


" 1 (72) 
V ní(m.— 2 c. EE 
> pm, — m) ù a \ ei 5.9.26 


during a small time interval ôt. 
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The quantity on the left is not quite the same as half the genic variance. 
It is the variance of the average excess of the alleles, whereas the genic 
variance might be called the covariance of the average excess and effect of 
the alleles (cf. 5.6.6). Under some circumstances (for example, in a randomly 
mating population derived from an F, hybrid) fitness may actually decrease 
under natural selection. However, this principle gives the correct formula for 
gene frequency changes. 

The principle (in the less general form first stated) is illustrated in 
Figure 5.9.1. We are assuming two loci, each with two alleles. The frequency 











Figure 5.9.1. An illustration of the Maximum 
Principle. For discussion see the text. 


of allele A, is given by p, and of B, by q,. The changes in gene frequency, 
óp, and óq,, lie on the ellipse given by 5.9.8. The fitness corresponding to 
each gene frequency combination is given by the ordinate so that there is a 
surface of fitnesses corresponding to the various gene frequencies at the two 
loci. The principle states that among the various possible gene frequency 
changes, determined by the ellipse on the plane of p, and q,, the path taken 
by the gene frequency change is the one that leads to the largest increase in 
the ordinate. It is also clear that if the surface has more than one maximum 
the population will tend to move toward the nearest peak and not the one 
that is highest. 
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We have here been using a model that has been extensively developed 
by Sewall Wright. Corresponding to the various allele frequencies at the 
various loci there is an adaptive surface. If the fitnesses are constant and if 
there is equilibrium with respect to linkage and random-mating proportions 
(or a constant departure therefrom, as measured by the 6’s and e's of Section 
5.6), the surface is uniquely determined. With loose linkage a population will 
tend to move rather rapidly to the surface as quasi-equilibrium is attained, 
and then tend to move upward on the surface toward the nearest peak. This 
model applies, obviously, only if there is such a surface. If, for example, the 
fitness coefficients are not constant the surface undulates with time and the 
population fitness does not necessarily increase maximally, or even at all in 
extreme cases. For a discussion of this, two of Wright's papers are (we think) 
especially relevant (Wright 1955, 1967). 


5.10 The Change of Variance with Selection 


If the number of genes influencing a trait or measurement is large the varia- 
tion is essentially continuous, and superficially the pattern of inheritance is 
much the same with Mendelian heredity as with pre-Mendelian blending 
theories. For example, it is quite consistent with both hypotheses for progeny 
to be somewhere near the average of their parents, for selection to change the 
population in proportion to the amount of variability, and for the correlation 
between relatives to be roughly proportional to the amount of their common 
ancestry. In fact, much of natural selection and the progress made in animal 
and plant improvement by selection can be explained equally well on either 
assumption. 

In one respect, though, the Mendelian particulate theory is greatly 
different. With particulate inheritance the variability is lost only very slowly; 
in fact, with an infinitely large population and no selection the variance is 
conserved indefinitely. In a finite population the variance is reduced by 
about 1/2N per generation. When N is large, this is a very slow rate of re- 
duction and a small input of new mutations is sufficient to balance it. 

With blending inheritance the basic nature of heredity is more analogous 
to a fluid than to a series of particles. Just as a mixture of red and white paint 
is forever pink and the red and white cannot be recovered, the variability of a 
population is reduced by each mating between differing types. Suppose that 
in a population of size N the ith individual deviates from the mean by an 
amount x; for whatever quantitative trait is being measured. The variance 
then is (): x2)/N. With blending inheritance the progeny of each mating are 
uniformly the average of the parents. Thus the progeny of a mating between 
two individuals deviating by amounts x; and x; will deviate by (x; + x;)/2. 
The average squared deviation of all progeny will be >} [(x; + xj)/2]^/N, 
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or } (x? + 2x;x, + x3)/4N. But, with random mating x; and x; are uncorre- 
lated, so the middle term has an average value of 0 and drops out. Since there 
is no difference between the sum of the x?'s and the sum of the x7's, the 
variance becomes (Y x7)/2N, just half the value of the previous generation. 
The reduction in variance under random mating with blending inheritance is 
the same as that with self-fertilization in a Mendelian system. 

This means that there must be an enormous source of new variability 
if a blending system is to work. Darwin realized that natural selection requires 
variability in order to be effective. For this reason, and not knowing of 
Mendel's work, he gave considerable thought to what we now realize was 
wholly unnecessary—the devising of various schemes to generate new vari- 
ability. Although inventive and showing Darwin's characteristic thoughtful- 
ness and originality, they seem quite strange to contemporary biologists. 
Now we know that a small mutation rate is quite sufficient to offset the decay 
of variability from random drift in all but very small populations. The subject 
of blending vs. particulate inheritance is discussed in the first chapter of 
Fisher's Genetical Theory of Natural Selection (1930, 1958). We found it 
interesting reading and you probably will also. 

Selection changes the variance. Whether the variance is increased or 
decreased depends on such things as gene frequencies, dominance, epistasis, 
linkage, and the mating system. Natural selection is based on total fitness and 
the effect on any measurable trait depends on the relation between that trait 
and the expectation of surviving and reproducing. For most measurements, 
such as size, the optimum from the standpoint of fitness is intermediate, so 
selection tends to remove extreme deviates. Ordinarily this tends to decrease 
the variance, but not necessarily at every locus since the favored type may be 
heterozygous at some loci. 

If selection proceeds long enough with no input of new variability from 
mutation or migration the genic variability eventually becomes smaller and 
smaller. The gene frequencies change so that either (1) the favorable alleles 
all have a frequency of one, in which case there is no variance, or (2) there 
are stable equilibria, such as with a locus that is overdominant for fitness, at 
which point the genic variance is O (see, for example, the last column in 
Table 4.1.1). So eventually the genic variance becomes 0 for each locus, 
although it may increase temporarily. For additive loci with free recombina- 
tion, the same is true for the combined genic variance from all loci. 

Let us now consider the change of variance in more detail. In particular 
we want to ask what is the rate of change of the genic variance. We can do 
this by the same methods that have been used earlier in this chapter to 
study the change in the mean fitness. 

As in Section 5.6 we shall consider a single locus with two alleles, but 
the formulae will be written so that the extension to multiple alleles is apparent. 
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With Hardy-Weinberg ratios the genic variance in fitness is 
V, = 2[p(m, — my + pom, — iny^]. 5.10.1 


The rate of change of V, is given by 


dV, dp 
E - 2] (m, — my —— 4 (m; — my 2d 
dm, dm dm; | dm 
4 nido 
Lie D di -a + p;(m;— m( rs dí h )|. 
5.10.2 
But, 
dp _, dp as 
m = p,(m, — m), ar = py(m; — m). 
Making these substitutions, 
dV, 
ae = 2[p,(m, — m)?’ + p,(m, — m)*] 
5.10.3 


ed (RTT. 
dt dt dt dt dt dtj dt 


From its original definition (e.g., 5.3.2), 


Mı = MPi + M2 P2, Ma = M42 Py + M22 P2 


and 
dm dp dp dm dp, dp 
o cg pm I SENILIS 
Also 
dp, | dp; 
— + — = 
dt dt i 


since the total change in all alleles must add up to 0. So, making these 
substitutions, 
dV, 


ES = 2[p,(m, — my + p,(m, — m)] 


dp, Y dp, dp dpjV 
+ afma (2) 2m, PEPE cm (2) | 


5.10.4 
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Finally we let m;; — à + a; +a; + Djj, as in Section 5.6. Substituting these 
and simplifying, we find that m,, is replaced by Dii, miz by D,;, and m;; 
by D2 . 

The generalization to multiple alleles is straightforward, so we can write 


d 
Mo = 2» pm; cpm dpi dp; p, 


TTE di di ij: 5.10.5 


This, summed over all loci, gives the rate of change of the genic variance for 
all loci. (We are ignoring the complications of close linkage and epistasis, 
and variable selection coefficients.) 

The quantity D;; is sometimes positive and sometimes negative, de- 
pending on the direction of dominance. With a large number of loci the 
positive and negative values cancel and, unless the direction of dominance 
is correlated with fitness, the average value will be close to 0. The same thing 
can be said for epistatic components that enter when the equation is extended 
to multiple loci. In many cases, probably most, the last term is small enough 
to be neglected. 

The first term on the right side of the equation is the third moment 
about the mean of the distribution. The third moment is conventionally used 
as a measure of skewness. As the number of factors gets large there is a closer 
and closer approximation to the normal distribution, which is symmetrical 
and therefore has a zero value for the third moment. So, the larger the 
number of genes involved in the trait, the less important this term becomes. 

We therefore reach the conclusion that with a large number of loci 
involved (as must always be true for fitness, and often is for artificial selection 
of quantitative traits in livestock), the variance is changed by selection only 
slowly. The population is not likely to exhaust its genic variance unless it is 
small, or the number of loci is small, or selection is very intense. 

Recently, Turner (1969) has emphasized this point that with no input 
of new variability the genic (or with close linkage and epistasis, the gametic) 
variance eventually approaches 0. He suggests that this is the most funda- 
mental statement that can be made about natural selection. In view of the 
very slow rate of change of variance that we have just discussed, we think 
that regarding this as the fundamental statement about selection is debatable, 
but in any case it is one statement that can be made about selection. 


5.11 Selection Between and Within Groups 


Our discussion of selection has dealt entirely with selection among individuals, 
Yet it is obvious that selection occurs at all levels—between individuals, 
between families, between communities, and between species. There may also 
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be intercellular selection within an individual, although we shall not discuss 
this. 

We should expect that the competition would be greatest between 
individuals who are close together geographically and ecologically, for they 
will then be competing most directly for the same resources. Likewise, the 
competition between individuals should be greater the more alike they are, 
for with greater similarity comes greater preference and need for the same 
resources. This gives an explanation for frequency-dependent selection where- 
by a genotype is favored when rare but becomes relatively less fit when 
common. 

On the other hand, selection between similar genotypes has a lessened 
effectiveness in changing gene frequencies because there is less genetic 
variance. This is well illustrated by selection within families. 


Within-family Selection This subject was discussed by Haldane (1924) 
and more recently by King (1965). They have given calculations and formulae 
for a number of special cases that we shall omit. 

Competition between individuals within a family may be increased by 
any of the following: competition between members of a litter (for example, 
a female mouse producing 25 % lethal zygotes often does not have a corre- 
sponding reduction in litter size) ;compensation in uniparous animals whereby 
a fetal or early childhood death leads to the next progeny's coming earlier 
than it would have otherwise, as we shall discuss in Section 6.10 in 
connection with selection on the sex ratio; the practice by animal breeders 
of culling litters down to a uniform size; family planning in humans whereby 
families tend to have the same number of surviving children. 

If all selection is within families and none is between, we can get a 
general idea of what happens with selection as follows. The correlation 
between sibs for additive genetic effects in a randomly mating population 
is 1/2, as we showed in Chapter 4. If V, and V,, are the genic variances between 
and within families, and the total genic variance is V, = V, + V, , the corre- 
lation between members of a family is 





V, 
r= Lay 5.11.1 
as shown in the appendix (A.4.12). So 
L^ =f sR 5.11.2 
V, 


g 


If r = 1/2, then the variance within sibships is just 1/2 that of a randomly 
mating population. So we would expect selection to change fitness or what- 
ever trait is being selected only half as fast. 
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As for the change in gene frequencies, this too is only half as fast. It 
could hardly be otherwise, but here is a demonstration. From the Funda- 
mental Theorem of Natural Selection, 








dm am dp; 
y -——z- ; 11. 
AT 2. op. i 5.11.3 
The genic variance within sibships is 
l 1 — ôm dp: om l dp, 
V, == V, =+ CE EPE RM S Api 5.11.4 


2 * 240p, dt ap,2 dt^ 


Since ðômi/ðp; does not depend on the intensity of selection or rate of gene 
frequency change, the change in the p;'s is only half as fast as with ordinary 
selection. 

This result will apply strictly only when the Fundamental Theorem holds, 
so it is not exact for strong selection with discrete generations. For example, 
rare recessive lethals are eliminated about 2/3 as fast with intrafamily selection 
as otherwise (King, 1965). 


Completely Isolated Groups Somewhat the same thing must be going 
on in nature between populations as happens within populations. Some 
increase in frequency while others decrease, just as genes or chromosomes do 
within a population. Some become extinct; others enter into stable equilibria 
in which several populations coexist. Meanwhile changes are also taking 
place within each of the populations. If we consider a community of organ- 
isms of different competing species, there is a general overall increase in 
fitness (that is, intrinsic fitness, relative to a fixed standard). Such an increase 
is counteracted by crowding, deterioration of the environment, and such, 
just as within a species, so that the total biomass does not change much. But 
it is perhaps worthwhile to formulate a principle like that for within-popu- 
lation selection that states the rate of increase in overall fitness. 

Consider a total population, divided into n subpopulations, not neces- 
sarily of equal size. We assume there is no gene interchange between the 
subpopulations. Let 


M = Y, P, m, = the average fitness of the entire population, 
m, — the average fitness of the kth subpopulation, and 
P, — proportion of all individuals that are in the kth subpopulation. 


Then, 


dP, 


dM dm, o 
u tlg tme 


5.11.5 
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But 
dm, | 
dt ~~ "Gk? 
the genic variance of the kth subpopulation, and 
dP = 
z P. — M 


since m, is the average rate of increase of the kth subpopulation, measured 
in Malthusian parameters (see, for example, 1.6.8a). Also 


> m, P.(m, Ue M) = Y; Pym, — My = Vz, 


where V. is the variance of the mean fitnesses of the subpopulations. 
Substituting these into 5.11.5 gives 


M Vat Va. 5.11.6 
This says: The rate of increase in fitness of a structured population is equal 
to the average genic variance of the subpopulations plus the genotypic 
variance of the means of the subpopulations. 

The effects of nonconstant departures from random-mating proportions, 
variable selection coefficients, and deterioration of the environment can be 
taken into account in the same way as in Section 5.6. 

Since the selection between groups depends on the total genetic variance 
of the group means rather than only the additive portion thereof, this selection 
is somewhat more efficient than that within subpopulations in this respect. 
On the other hand, the group mean differences are not likely to be anything 
near as large as the differences among individuals within the groups. 

The mechanism of intergroup selection is the increase or decrease, and 
eventual fixation or extinction, of whole populations or species. Such extinc- 
tion and replacement by new species split off from existing ones is in no small 
part responsible for the continuing wide diversity of life. But how much of 
the overall increase in adaptability is developed this way is problematical. 
We are impressed by the much larger variance within a group than between 
group means, the larger numbers that minimize the ** noise" from random 
events, and the slower effective ‘‘ generation length ” for intergroup selection. 

The point has been made very forcefully by R. A. Fisher (1930, 1958, 
p. 49). 


The principle of Natural Selection...refers only to the variation among 
individuals (or co-operative communities), and to the progressive modification 
of structure or function only in so far as variations in these are of advantage 
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to the individual, in respect to his chance of death or reproduction. It thus 
affords a rational explanation of structures, reactions and instincts which can 
be recognized as profitable to their individual possessors. It affords no cor- 
responding explanation for any properties of animals or plants which, without 
being individually advantageous, are supposed to be of service to the species 
to which they belong. 

There would, however, besome warrant on historical grounds for saying that 
the term Natural Selection should include not only the selective survival of 
individuals of the same species, but of mutually competing species of the same 
genus or family. The relative unimportance of this as an evolutionary factor 
would seem to follow decisively from the small number of closely related species 
which in fact do come into competition, as compared to the number of indi- 
viduals in the same species ; and from the vastly greater duration of the species 
compared to the individual. Any characters ascribed to interspecific selection 
should of course characterize, not species, but whole genera or families, and 
it may be doubted if it would be possible to point to any such character, with 
the possible exception of sexuality itself, which could be interpreted as evolved 
for the specific rather than for the individual advantage. 


Selection of Cooperative and Social Behavior How, then, do 
characters of benefit to the species evolve? For example, it has often been 
suggested that polymorphism is of benefit to the species, by making it more 
able to tolerate fluctuatingenvironmentalconditions. However, the mechanisms 
maintaining the polymorphism (heterosis, selection depending on frequencies, 
mimicry) are all intrapopulational. So we have no real assurance, solely from 
the existence of the polymorphism, that it is either good or bad for the species 
in addition to its value to the individuals in the population. 

A great many seemingly altruistic behavior patterns may really benefit 
the individual himself. Much cooperative behavior must be mutually beneficial. 

Altruistic, or truly self-sacrificing behavior, is we believe more easily 
explained by ‘kin selection." An extreme example of self-sacrifice is the 
worker bee who is likely to die in the process of stinging a person who 
threatens the hive. This is understandable, of course, in that the worker bee, 
being sterile, has hope of progeny only if the queen with whom the worker 
shares many genes is kept alive and fertile. This means that those genes which 
cause the worker bee to protect the queen will be transmitted dispropor- 
tionately by the queen to future generations and the trait, among workers, 
will increase. In a less extreme way, factors tending to lead a mother to 
protect her children would have similar selective value; for in this way the 
mother enhances the contribution of her genes to the genetic makeup of 
future generations. Since her child shares 50% of her genes, and has more 
of its reproductive life span ahead, it is worth a large risk to the mother to 
keep the child alive; one would expect natural selection to favor such pro- 
tective behavior. In general; and in a very rough way, one might expect 
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natural selection to act in such a way as to perpetuate behavior patterns that 
would lead an individual to take a risk, r, to save the life of another individual, 
where r is the coefficient of relationship between the two individuals, who 
therefore share a fraction r of their genes. 

The structure of the human population in the past, where there were 
many small, rather isolated groups, may have been important in developing 
social behavior. Individuals in a group are likely to be rather closely related 
genetically and this would be an opportunity for the kind of kin selection that 
we have been discussing. The important role that this may have played in 


the development of social systems in man has been emphasized by Muller 
(1967). 


Partially Divided Populations and Wright's Theory of Evolution 
There must be situations, and a great many of them, where the population is 
neither one large unit nor is it completely divided into subgroups. The popu- 
lation may be highly structured with subgroups that are isolated enough to 
prevent random exchange of genes, but between which there are some 
migrants. Such a structure lies at the base of the shifting balance theory of 
evolution that has been discussed extensively by Sewall Wright (1932, and 
many later papers elaborating the idea). 

Wright argues that a partially subdivided population permits some local 
differentiation. This may take place for reasons of different selective forces 
in different localities. It also may happen because the local groups are small 
enough for random events to lead to substantial fluctuations. If, as a result 
of such processes, a particularly happy combination of genes or gene fre- 
quencies emerges, then this can spread to neighboring areas. Thus Wright 
thinks of natural selection as being a mixture of selection within and between 
populations; although he does not distinguish between traits that are of 
individual merit and those that have group merit. 

The opportunity for local differentiation, with random processes playing 
an appreciable role, may help the population to get over a point of unstable 
equilibrium such as was discussed at the end of Section 5.4. In Wright's 
metaphor, the population may have an opportunity to move from one 
adaptive peak to another; something that would not happen in a strictly 
panmictic large population. 


5.12 Haldane's Cost of Natural Selection 


We have given in Section 5.3 formulae that tell what time is required to change 
the gene frequency under different intensities of selection. Haldane (1957) 
was the first to formulate the question of how much total selection is required 
for a gene substitution over the whole period of time involved. It is clear 
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that there is a limit of some sort (although it might be quite elastic) on the 
number of genetic changes that can be carried out simultaneously. For 
example, this point was mentioned in the quotation from Sewall Wright at 
the beginning of Chapter 5. In his 1957 paper Haldane said: “‘In this paper I 
shall try to make quantitative the fairly obvious statement that natural 
selection cannot occur with great intensity for a number of characters at once 
unless they happen to be controlled by the same genes." 

When we examine equations such as 5.3.13-5.3.16 we see that t and s 
are inversely proportional, and that for a particular situation the product st 
is a constant. If the selection is intense, the time is shorter. So, in a vague 
sense, we see that the total amount of selection during the time is constant 
for a given frequency change and type of gene action; or at least st is constant, 
whatever its meaning is. 

Haldane formulated the question in a more meaningful way. Our treat- 
ment is slightly different from Haldane's, but the two lead to the same 
equations. We shall discuss the differences later. 

Consider a single locus in a haploid species and with a discrete generation 
model. We shall extend this to diploids later, but the essential idea is present 
in the haploid situation and is less obscured by algebraic fog. Assume the 
following model: 


GENOTYPE Á A 

FREQUENCY D q—-1-—p "ME u 
FITNESS w, wal—s) ~~ wa(l — sq) 
RELATIVE FITNESS 1 l-s 


A is the favored gene. We assume that it is initially rare, either because it is a 
new mutation that hasn’t occurred before, or, more likely, there has been a 
change in the environment such that a previously deleterious mutant is now 
favorable. 

In the first generation (generation 0) the frequency of the A gene is po. 
This will be represented next generations by w4 descendants. The other 
genotype will be represented by w,(1 — s). Relative to the A genotype, a 
fraction s of the A’ genotype will fail to survive or their reproduction will be 
lowered by an equivalent amount. In the whole population the ratio of 
individuals (or genes) not transmitted to those that are transmitted will be 
Wy Sqo/Ww4(1 — So) = sqo/(1 — sqo). Next generation the proportion of A’ genes 
will be q; 2 qo(1 — s)/(1 — sqo), and so on in successive generations. 

To be concrete, we may think of a population with a reproductive excess, 
but which because of mortality has an adult population that is roughly 
constant from generation to generation. In nature, as we discussed in Chapter 
1, the population size stays stable despite gene frequency changes because its 
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size is determined by other factors (e.g., food limitation). In this population, 
if sq = .2 and ] — sg = .8, the ratio of eliminated to noneliminated is .2 : .8 
or .25. [n other words, if selection is entirely through mortality (and all other 
factors are ignored), each parent would have to produce 1.25 progeny in 
order for the population size to remain stable. 

If the selection is acting on fertility there must be enough variability in 
progeny number so that 


Average number from genotype A — Average of population 
Average of population 


which is equal to 





w 


can, in this case, be .25. 

We shall call the quantity sg/(1 — sq), summed over all the generations 
involved in the gene substitution, the cost of a gene substitution. Designating 
this by C, we have 


Sqr 
C= 2 5.12.1 
2 Ix 541 





The table below gives the value of C for several values of s, assuming 
that the initial frequency of A, po, is .01. 


Po 5 C 
.01 1.00 99 
.01 .99 52 
.01 .50 6.2 
.01 .10 4.8 
.01 .01 4.63 


.01 Limit 4.61 


Three things stand out in this table. First, the values are strikingly high. 
For example, if s = .1, the total cost is 4.8 nonsurvivors per survivor, or in 
terms of numbers of individuals the total number of extinctions is 4.8 times 
the number of survivors in a single generation. This assumes, of course, that 
the number of adults is roughly constant from generation to generation. 
Second, the total cost is less if the selection takes place slowly, i.e., if s is 
small. Third, the value does not change much after s becomes as small as .10 
and hardly at all when s is as small as .01. 

It is not necessary for s to be constant during this process as long as it 
remains small and positive. It is quite likely, of course, that s changes as it is 
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placed in combination with other genes, as the population density changes, 
and as environmental fluctuations occur. This does not make any difference 
unless the sign changes. 

The third point is particularly remarkable. The fact that the cost is 
virtually independent of s when s is small means that we can calculate the 
total cost solely from the initial frequency po. This can be done as follows. 
We assume that the change in gene frequency is slow enough that the addition 
in 5.12.1 can be replaced by integration. Thus, to a satisfactory approxi- 
mation, 











* sq 
C= f, E dt. 5.12.2 
From 5.2.2 
spq — dp 
Ap- zx —- 
= T= sq at 
or 
1 — sq 
dt = dp. 6.12.3 
spg 


Substituting this into 5.12.2 gives 
!d 
C x xL —]n po. 5.12.4 
Po P 


Notice that —In po is 4.61 when po = .01, in agreement with the limiting 
value in the table above. 

Actually, the upper limit should not be exactly 1 because the allele 
frequency will be kept from this value by reverse mutation; but this is a 
negligible correction, as can be seen by replacing the upper limit in the inte- 
gration by a value close to 1. 

Equation 5.12.4 shows that as long as s is small enough for the con- 
tinuous model to be applicable the total load is independent of s and of the 
time required to make the gene substitution, but depends only on the initial 
frequency of the substituted allele, po. 

Here are a few numerical examples. 


Do C= — |n po 
107° 14 
1074 9 
107? 4.6 


10-! 2 


248 AN INTRODUCTION TO POPULATION GENETICS THEORY 


These numbers are all quite large. This means that, if the typical allele 
has an initial frequency of 10 *, a population of 1 million individuals will 
have to have 9 million genetic deaths each generation if it is to substitute 
an average of one allele per generation. Or more probably, if there is to be a 
gene substitution every 100 generations, the average fitness will be lowered 
by .09. 

Notice the dependence of the total load on the initial frequency. Most 
of the cost is in the early generations when the favored gene is still rare. This 
means that genes that are initially common, either because of a high mutation 
rate, or because they were only mildly disadvantageous previously, are the 
easiest to substitute. However, it is probably such genes with only mild 
deleterious effect that are most important in evolution, not only because they 
are initially less rare, but because a gene that is only mildly deleterious in the 
old environment is more likely to be beneficial after an environmental change 
than one that was originally very harmful. 

An alternative formulation that does not depend on s being small has 
been given by Felsenstein (see Crow, 1970). 

The load becomes greater (and the algebra more difficult) when a diploid 
species is considered. The procedure follows the same line as before. We 
assume random mating and let A be a measure of dominance. 


GENOTYPE AA AA, AA, 
FREQUENCY (1— py? 2p(1— p) p 
RELATIVE FITNESS 1—s 1— hs 1 


w =(1 —s)(1 — p? + (Il — As)2p(1 — p) + p? 


5.12.5 
= 1 — s(l — pXl — p + 2hp). 

From 5.2.6 

dt W , "ur 


where w;, the fitness of A,, is [p(1 — pY(1 — As) + p?]/p = 1 — hs(1 — p). 
Substituting this and 5.12.5 into 5.12.6, 


dp sp(1— p)[1— ^ — p(1 — 2h)] 
I mm a $4. 5.12.7 
dt W 


The cost, by analogy with 5.12.2, is 


caf oP at. 


Ww 





5.12.8 
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Substituting from 5.12.5 for w and from 5.12.7 for dt in 5.12.8 leads to 





=f s(1 — p)(1 — p + 2hp) ap 
po SP(1 — p)[1 — h — p(1 — 2h)] 
: j 5.12.9 
= l + h In —— 
[23 n? eres 
when h Æ 1. 
1 
C=- | + In po — -]. 5.12.9a 
Po 
when / = 1. 


Here are some approximate numerical values. These correspond very 
roughly to an initial frequency, po, determined by a mutation rate of 10^? 
and homozygous disadvantage 0.1. 


h Po C 
0 10-* 9 A, dominant 
0.5 107* 18 
0.9 107? 50 
0.99 107? 70 
1.0 107? 100 A dominant 


Thus, in a diploid system, the cost of substituting a moderately rare 
mutant (e.g., one previously maintained by a mutation rate of 1075 to 107%) 
is from 10 to 100. That is to say, the total number of eliminated individuals 
is 10 to 100 times the number of adults in any single generation. Haldane 
suggests 30 as a representative number. 

Haldane also suggests that a species might perhaps devote some 10% 
reproductive excess, or the equivalent in variability, to this process. With this 
cost value, a species could carry out one gene substitution on the average 
every 300 generations if each one involved a cost of 30. If a pair of species 
differ by 1000 loci and 10% reproductive excess were devoted to the process 
(with perfect efficiency), it would have required about 300,000 generations 
for this divergence to take place. 

Our formulation differs from Haldane's in giving the cost as the ratio 
of zygotes eliminated by differential viability and fertility to those not 
eliminated. Haldane gave it as the ratio of those eliminated to all zygotes. 
When s is small these are practically equivalent, so he has the same equations. 
However, our formulation gives a more exact representation for larger values; 
and perhaps the meaning is more transparent. 
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Extensions to this problem to include epistasis and the effects of chance 
have been made (Kimura, 1967; Kimura and Maruyama, 1969). We have 
assumed here a sudden environment change so that a previously deleterious 
gene immediately becomes favorable, as might be the case if a species mi- 
grated to a new locale. On the other hand, the change may be gradual. It can 
be shown (Kimura, 1967) that this makes only a small change in the calcu- 
lations; the general conclusions are essentially unchanged. 

Most organisms have a reproductive rate far in excess of that required 
to maintain the population size in the absence of preadult mortality. Haldane 
was concerned with only that part of mortality and sterility differences that 
were selective. A million-fold reproductive excess, such as is found in some 
plants and invertebrates, is not necessarily more effective in selection than an 
excess of a few percent in which the differences are mainly genetic. Observa- 
tions on the total reproductive rate set only an upper limit on the amount of 
gene substitution that can occur. We have no way, from this measurement 
alone, to determine whether or not Haldane's 10% is a reasonable value of 
the differential fitness that can be applied to evolutionary change. 

According to King and Jukes (1969) the evolution of mammalian struc- 
tural proteins has been at a rate averaging about 1.6 x 107? per year. Let's 
regard this as roughly 10$ per cistron per generation. If the mutant pheno- 
type had an initial frequency of 0.0001 the cost, as we have seen, is some 10-50 
per gene substitution. The cost per generation is then about 10-50 x 1076 
per generation for these proteins. If 10* loci were evolving at this rate the 
cost per generation would be 10% to 50%. 

It is apparent from equations 5.12.9 and 5.12.9a that the cost is dependent 
very strongly on the initial frequency of the favored allele. One way in which 
the cost may be reduced is if the genes start at higher frequencies. Here are 
some representative values of the total cost required for a specified gene 
frequency change. These are for // 2 1/2 in equation 5.12.9. 


GENE FREQUENCY CHANGE TOTAL COST 

FROM TO 

.0001 .9999 18.42 
.001 .999 13.81 
.01 .99 9.19 
l 9 4.39 
2 8 2.77 
3 -7 1.69 
4 6 .81 


This means that if the same phenotypic change can be accomplished by 
changing five genes from 0.4 to 0.6 as by changing one gene from .0001 to 
.9999 (as would seen reasonable), the cost is 5 x 0.81 — 4.05 as opposed to 
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18.42. Sewall Wright has repeatedly suggested that adaptive changes may 
depend more on shifting balances among genes with moderate frequencies 
than on substitution of rare mutants. In Wright's scheme the cost would 
appear to be less in this regard. 

The remarkable property that the cost does not depend on s when the 
value of s is small implies that all gene substitutions involving the same 
dominance and the same initial frequency have the same cost. This suggests 
that if two mutants were closely linked they could both be substituted for 
half the cost of the two if substituted independently. A difficulty, however, 
is that the double mutant is likely to be rare. In simple cases these effects 
exactly compensate. If the single mutant has frequency pg and the double 
mutant p2, then the cost for the linked double mutant is the same as for the 
two independent mutants, since —log p = —2 log pọ. This relation is 
strictly true only when h = 1/2, but it gives the general idea. 

The greatest difficulty in interpreting the Haldane cost principle is un- 
certainty about the importance of gene interaction. The Haldane argument 
that we have given assumes that the genes being substituted are independent 
in inheritance and in action. Sved (1968a) and Maynard Smith (1968) have 
both called attention to the change that is brought about when there is 
truncation selection. The consequences can be quite different. The number 
of loci at which selection can act simultaneously with the same total intensity 
of selection is many times as great. 

Whether selection acts mainly on independent genes or in a threshold 
manner is not clear at present. Certainly some traits are quite independent 
of one another. For example, different genes are known to confer resistance 
to different insecticides or antibiotics. And presumably genes for feathers are 
independent of genes for laying more eggs. On the other hand, if the environ- 
ment changes in such a way as to make increased size adaptive, a number of 
genes would be acting simultaneously on the same trait. Selection is not 
likely to act in a strict threshold manner in nature, but it might approach 
this nearly enough to reduce the cost substantially in comparison with the 
value if each allele were independent. 

There is obviously a limitation somewhere on the number of genes—or 
at least on the number of independent genes—but our present ignorance as 
to the nature of the interactions of the genes concerned with fitness limits 
the practical utility of the principle. Nevertheless, we believe that this is an 
important idea and agree with Haldane's statement that ** quantitative argu- 
ments of the kind here put forward should play a part in all future discussions 
of evolution." He also emphasized that this is only a beginning to a theory 
and added that “I am aware that my conclusions will probably need drastic 
revision." 

For a review, see Crow (1970). 
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The Cost in Terms of Variance We have seen earlier that the rate of 
change in fitness and gene frequencies is related to the genetic variance of 
the population. So we might try a variance analogy to the Haldane cost 
principle. Again, we start with a simple haploid model, as before. 

The genic variance (or since we are dealing with a single locus in a 
haploid, the genotypic variance as well) is pgs? for the model that we have 
just been discussing. Then the analogy to 5.12.2 would give the variance 
cost as 


C= Í pqs? dt. 5.12.10 
0 


This time it is more convenient, and perhaps more realistic, to consider a 
continuous model. Substituting from 5.3.6 gives 


1 
C= | sdp=s. 5.12.11 
0 


So, the total variance required to produce a gene substitution, is the 
selective difference between the two alleles. 

This is easily generalized to the diploid case, and the answer can be 
worked out quite generally by using the principles of Section 5.6. Suppose 
the difference in fitness between the original genotype and the one in which 
the substitution has been made is s; we can write for the total variance cost 


C, = [5 dt. 5.12.12 
0 
But, from the Fundamental Theorem of Natural Selection, 
dm 
uh = Vg: 5.12.13 


Substituting this into 5.12.12 and changing the limits of integration to corre- 
spond to the change in fitness by an amount s, we have 


A+s 
C, =Í dm =s. 5.12.14 
A 


Thus to change the fitness of the population by an amount s every n 
generations demands a genic variance of s/n in the population. The actual 
variance of most populations is quite large, but it is hard to know what 
fraction is genic. 

It would appear, however, that this principle is less restrictive than 
Haldane’s; so Haldane’s principle is likely to be the most useful in estimating 
the rate of gene substitution and comparing this with the actual values as 
they are beginning to be known from amino acid analysis. In any case, it 
appears to be more restrictive for the substitution of initially rare genes. 
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5.13 Problems 


l. 


CA 


How many generations are required to change the homozygous recessive 
phenotype frequency from 0.5 to 0.01 when s = 0.01? When s=0.1? 
Compare the value for s = | from Table 5.3.1 with the exact answer from 
5.1.1. 


. Derive 5.2.7. Note that w=}; piw: — pj W; - 3, j«;D;W; and w, = 


Q isip;wplü — pi) 


. Using 5.2.12 for the 2-allele case, 


EL AC — pi) dw 


A , 
" 2w dp, 


derive 5.2.14 and 5.2.15. 


. Show that the rate of gene frequency change is maximum when the re- 


cessive allele is twice as frequent as its dominant allele. 


. What is the Malthusian parameter of a lethal genotype? 
. Show that with 2 alleles, no dominance, and a continuous model that 


u, = ug + st, where t is time in generations, u is log, [p/(1 — p)], and 2s 
is the selective difference between the two homozygous types. 


. With complete dominance and selection in favor of the dominant allele, 


define u, in terms of p so that the simple linear relation of Problem 6 
applies. 


. Show that with two alleles and constant inbreeding coefficient f the rate 


of gene frequency change is given by 


| p(l — pjdw du j 
Ap — 258 i tu 


where w' is the average fitness of a completely homozygous population. 





. What transformation will linearize the data of Figure 5.2.1, assuming 


that Equation 5.2.27 is appropriate? 


. Fora polygenic trait how fast (approximately) is selection within families 


of half-sibs compared with mass selection and random mating? 


. Show that the rate of elimination of a lethal mutant is about 2/3 as fast 


with intra-family selection as in general for a very rare recessive gene. 


. Show that for a single locus the average excess, a;, of an allele is equal 


to a(1 +f), where «a; is the average effect and f is the inbreeding 
coefficient. 


. Whyis it not generally possible to obtain the genic variance of a partially 


inbred population by measuring the genic variance before the consan- 
guineous matings began and multiply this value by 1 +f? When does 
this procedure give the right answer? 
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14. Show that when the es are constant in Section 8.7 (e, = p,/p,p,, etc.) 


then Z is a constant. 
15. Show that constant 0’s in Section 5.6 do not imply that the inbreeding 
coefficient f is constant, and therefore that when f is constant (except 


f = 0) the fitness increase is not equal to the genic variance. 


6 
POPULATIONS 


IN 
APPROXIMATE 
EQUILIBRIUM 


fraction, of the selection is effective in causing the systematic changes in 

gene frequencies that we think of as evolution. Much differential survival 
and fertility is purely accidental—an animal may survive because it happens 
to be in the right place at the right time. This is especially true of organisms 
that produce a great excess of progeny of which only a few survive to maturity. 

Of the survival and fertility that is not accidental, some is due to purely 
phenotypic differences rather than genetic, such as the effects of an earlier 
environment or an accident of development. Only that part of the selection 
that is based on genotypic, as opposed to environmental, differences has 
any influence on the genetic composition of future generations. 

Much of the genotypic selection goes to eliminating recurrent mutations, 
to eliminating inferior genotypes that arise by segregation and recombination 
from better adapted parents, and to adjusting to changes in the environment. 


[i a natural population only a fraction, and surely only a very small 
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Thus, most genetic selection goes to maintaining the status quo, rather than 
to making progressive evolutionary changes. 

In Chapter 5 we considered systematic changes in gene frequency under 
selection, which might be called the dynamics of evolution. In this chapter we 
consider those factors that maintain the population in an approximate 
equilibrium—evolutionary statics (see Haldane, 19545). 


6.1 Factors Maintaining Gene Frequency Equilibria 


There are many ways in which population variability can be maintained. 
They all depend on some kind of balance between opposing forces, or else 
the variability is transient. Here are some of the most important mechanisms. 
The items on this list are neither mutually exclusive nor collectively exhaustive. 


1. Balance Between Mutation and Selection Each generation new 
mutant genes arise. Most of these are harmful in the environment of their 
occurrence and are eliminated sooner or later by natural selection, if not by 
chance. At any one time there will be a certain number that are not yet 
eliminated. The average number of such mutants will be determined by the 
balance of the mutation rate and the rate of selective elimination. 


2. Selection Favoring Heterozygotes The genotypes of highest fitness 
may not be homozygous, but may be heterozygous at one or more loci. In 
a simple 2-allele situation where the heterozygote is favored it is clear that, 
since the favored genotype has two different alleles, there will be a tendency 
for both of these alleles to remain in the population. The classical example is 
the sickle-cell hemoglobin heterozygote which has neither the anemia of one 
homozygote nor the malaria susceptibility of the other. In complex multiple 
heterozygotes the same general principle applies; the population tends to 
retain more than one allele at each of the relevant loci. 


3. Frequency Dependent Selection Ifthe fitness ofa genotype depends 
on its frequency and on what other genotypes are present—and to a greater 
or lesser extent this must very often be the case in nature—there are many 
ways in which stable equilibria can develop. The most obvious way, and per- 
haps the most important in maintaining variability, is selection in favor of 
rare genotypes. We have already discussed a situation that amounts to this— 
the self-sterility alleles found in some plants (Section 4.12). 

There is one very general consideration that makes it reasonable that 
many genotypes are favored when they are rare. This comes from the fact 
that competition must be most keen between those individuals that are the 
most nearly alike. They will have the most similar food needs, for example, 
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and will therefore compete most directly. To the extent that the genotypes 
differ in their needs a rare genotype will have less competition than a common 
one. Various experiments showing a greater yield or biomass in mixed than 
in homogeneous populations show that this is important, in at least some 
cases. The generality and importance of frequency-dependent selection as 
a factor maintaining population variability is largely unknown. 


4. Heterogeneous Environment If there are different habitats and the 
various genotypes respond to these in different ways, individuals will be more 
favored in some environments than in others. Each type will be more favored 
when there is room in his best environment, but not when he has to live in 
another. This effect will be accentuated if each individual is able to select his 
environment. This leads to a situation in which genotypes are favored while 
they are rare but become disadvantageous when they outgrow their niches. 
This leads to a stable polymorphism for the same reason that the example 
just above does. The conditions under which multiple niches maintain a 
polymorphism have been discussed by Levene (1953) and Prout (1968). 

The borders between the different environments may be sharp, or the 
transition may be gradual. There may be a geographical cline. For example, 
the amount of water might change from one end of the species range to the 
other. Organisms that do better in a dry environment are relatively favored 
at one end of the area and those that do best in a wet environment do better 
at the other end. This may maintain a gradual transition in gene frequencies 
over the range. 


5. Conflicting Gametic and Zygotic Selection, or Meiotic Drive 
If a series of alleles are such that one gamete type is favored, either because 
of selection in the gametic stage (e.g., selection based on the genotype of the 
pollen tube) or because of meiotic events that lead to a segregation bias 
(meiotic drive as defined by Sandler and Novitski, 1957), this can lead to a 
stable polymorphism when the favored type is deleterious in the zygotic 
selection. For example, the t-alleles in mice and the Segregation Distorter 
factor in Drosophila are both maintained at a fairly high frequency in the 
population despite drastic effects on viability and fertility when the genes are 
homozygous. 


6. Neutral Polymorphism If a series of alleles are nearly equivalent in 
their effect on fitness (near enough that the differences among them are of a 
lower order than the mutation rates or the reciprocal of the effective popula- 
tion number) there may be a neutral polymorphism in which the frequencies 
of the different types are determined largely by their mutation rates. In such a 
situation the effects of a finite population become especially important and 
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the number of alleles actually maintained will be determined by the balance 
between mutation and random extinction through gene frequency drift. 
This will be discussed in Section 7.2. 


7. Balance Betweeri Selection and Migration  Migrants that join a 
population from outside will usually be of a different frequency for one or 
more alleles from the local residents. Usually the outsiders will be less well 
adapted to local conditions, so there is a possibility of an equilibrium between 
the local selective forces which tend to favor some genotypes and the influx 
of other genotypes through migration. 


8. Transient Polymorphism |n addition to the various stabilizing 
mechanisms just mentioned, a population may be polymorphic because it is 
in the process of change. We saw in Chapter 5 that it may take many thousands 
of generations for a gene substitution to take place. This means that a transient 
polymorphism may persist for a long time and in nature this may be very 
difficult to distinguish from a stable polymorphism. 


In considering equilibrium problems we usually use a discrete generation 
model. The reason is mainly convenience. We pick whichever model is most 
effective or the simplest, and for most of the questions that we ask in this 
chapter the discrete model is preferable. The conclusions are almost always 
directly transferable from one model to the other, more so than when one is 
considering evolutionary dynamics. As a population gets close to an equili- 
brium point, it makes less difference which model is chosen since the rate of 
change per generation (or other time unit) is ordinarily slowest as the popula- 
tion approaches a stationary value. We are referring here to equilibrium 
values at intermediate gene frequencies: typically these are approached 
asymptotically. 


6.2 Equilibrium Between Selection and Mutation 


Consider a discrete generation model. Each generation is counted at the 
same age; and we shall consider that enumeration is at the zygote stage. 

We assume a single locus with two alleles, and with the three genotypic 
fitnesses and frequencies as follows: 


GENOTYPE A,A1 A,Az A2A2 
FITNESS Wis Wi; =w, (l — hs) | wii =w,,(l— s) 
FREQUENCY Pii 2P;; Pii 


w= P, Wi: T 2PiiWia + P2r2W22. 
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If w,; and w3, are less than w;, (i.e., if h and s are between 0 and 1) the 
A, gene will decrease. This decrease will continue until the loss of A; genes is 
balanced by the rate at which new ones are created by mutation. We let u 
be the mutation rate from A, to 4;. If 4; is rare, as it will be if selection is 
strong, reverse mutants from 4; to 4, will be very rare and can safely be 
ignored. 

One generation later the frequency of allele 4,, in terms of the present 
frequency, will be 


(Pj + w2 P45)(1 — u) 
w 


6.2.1 


(see the derivation of 5.2.1). The factor 1 — u in the numerator represents the 
fraction of A, genes that have not mutated during the current generation and 
are therefore still to be counted as 4;,. 

If the population has reached equilibrium, the allele frequency does 
not change and 6.2.1 can be equated to p,. Thus, 


Pw = (Pay + W2 Pro) — u) 6.2.2 


at equilibrium. 
We now consider several special cases. 


1. A; Completely Recessive Letting q =p, be the frequency of the 
mutant gene and p= p, that of the normal allele, w;, = 1,2, W22 = 
Wi(1 — s), W = wy (Ll — sP25), wy4 Par + Wi2 P4; = wip, and 


u 
P4--. 6.2.3 
M 


We get the entirely reasonable result that the equilibrium frequency of 
recessive homozygotes is the ratio of the mutation rate to the selective 
disadvantage of the homozygous-mutant genotype. 

If mating is at random, P,, = q^ and 


4| 


We need to mention a caution here. Although this formula is the correct 
average for an infinite population, it is not if the population is small. The 
distribution is highly skewed and the mean frequency of recessive genes, if 
these are strongly selected against, can be appreciably less than this formula 
gives even in quite large populations. For example, for a lethal mutant in a 
population of 1000 with a mutation rate of 10° > the frequency of mutants is 
only about one-third its value in an infinite population. The distribution was 


. 6.2.4 


“AIR! 
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first given by Wright (1937) and discussed more recently by Robertson 
(1962) and Nei (1968). This subject is taken up in Section 9.4 (see, for example, 
Figure 9.4.2). 


2. Highly Inbred Population As f/—1, P,,-—p'(1 —f) 4 pf p. 
Py, 7 pq(1 — f) 90, P3; q, W = wy, (1 — qs), and 


P, =q =-=, 6.2.5 
22— d s 


irrespective of /. We expect this, because with f = 1 there are no heterozygotes 
and the value of A is irrelevant. 


3. Random Mating with Complete or Partial Dominance In this 
case Pa =p’, P, pq, P2,=q*, and wW=w,,(1 — 2pghs—q’s). The 
equilibrium equation 6.2.2 becomes Ww = w,,(1 — ghs)(1 — u). 

This leads to the quadratic 


s(1 — 2h)g? + hs(1 + u)q — u = 0, 6.2.6 


whose solution gives the equilibrium value of q. 
If ^ — 1/2, that is to say there is no dominance, then 
2u 2u 


= ——— &-—. 6.2.7 
$ situ) s 


If h = 1, i.e., Ay completely dominant, 
qz-, 6.2.8 
s 


assuming that s > u. The frequency of the mutant phenotype is roughly 2u/s. 
If s can be measured this gives a basis for estimating the mutation rate. 

Many mutants, probably the majority, show some heterozygous expres- 
sion but have a much more severe effect in homozygotes. For example, the 
average newly arisen ''recessive" lethal in Drosophila has about a 5% 
decrease of viability in heterozygotes. So it is important to consider the popula- 
tion consequences when / is in the range 0.01 to 0.1. 

When the frequency of q is determined mainly by selection against heter- 
ozygotes, mutant homozygotes are too rare to be of any consequence and, if 
u is small, 


qs ne 6.2.9 
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When the frequency is determined largely by homozygous deleterious effects, 


. u 
lim q = 6.2.10 
h-0 S 


as we saw earlier for completely recessive mutants. 

Figure 6.2.1 shows the relation graphically for transitional values of A. 
When heterozygous selection dominates, ghs/u = 1. Divergence downward 
from the horizontal line means that homozygous selection is playing a role. 
The graph shows that, for large s and small u, even a small heterozygous 
effect dominates the situation; homozygotes are too rare to have much 
influence statistically. For example, if the mutant is lethal (s = 1) and u = 
.00001, a mutant with >1%, dominance has the same kinetics as if it were 
completely dominant, as the graph shows. 


1.0 


0.8 






0.6 u 
qhs $ = 0.00001 
u 
0.4 = = 0.0001 
0.2 


O 0.05 010 0.15 020 025 0.30 0.35 040 0.45 0.50 
h 

Figure 6.2.1. The equilibrium value of ghs/u as a function of A, where g 
is the mutant-gene frequency, s is its homozygous disadvantage, hs its 
heterozygous disadvantage, and « the mutation rate. When most of the 
elimination of mutants is by selection against heterozygotes, ghs/u— | 
and the values lie along the horizontal line. If eliminations are partly 
through homozygotes the curve dips down and becomes 0 if all elimina- 
tions are this way. 


Furthermore, there is strong evidence in Drosophila that A is larger for 
mutants with small s than when s is large (for a summary and references, 
see Crow, 1970). In general, the best evidence is that almost all the impact 
of natural selection on mutant genes is through their heterozygous effects. 

If h is large enough that most selection is in the heterozygous state, then 
the difficulty in finite populations discussed above is no longer so important. 
With partial dominance the mean frequency of mutants is almost independent 
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of the effective population number, although the variance is much greater 
in small populations of course. 


4. Nonrandom Mating, Partial Dominance In this case, let P,, = 
p*(1 — f) + pf, Py2 = pq — f), and P22 =9°(1 — f) + qf. If h, f, and u are 
all small enough that products of three of them can be neglected, the solution 
of 6.2.2 becomes 


u 
15 rft 


If f= 0, we have the situation just discussed. If f> h then homozygous 
selection is most important. If h > f (unless both are very small, or s is very 
small) most of the selection is in heterozygotes. 

Reverse mutation is not likely to be important for two reasons. One is 
that the rate is usually less. The other is that the mutant gene, especially if As 
is large, is very rare so there are few opportunities for reverse mutation. 


6.2.11 


X-linked Locus The case of primary interest is that where the mutant is 
deleterious and recessive. In contrast to the situation for autosomal reces- 
sives where a small amount of heterozygous selection may be the most import- 
ant effect, a small amount of heterozygous expression has only a slight effect 
on the frequency of an X-linked recessive. The reason is that a single recessive 
gene is expressed in males and thereby exposed to the full effect of selection. 

If the fitness of mutant males is 1 — s, relative to normal males, and 
heterozygous females are not greatly impaired, then the rate of elimination 
of mutant genes per generation is about 5/3, since one-third of the X chromo- 
somes are in males. If this is balanced by new mutations, u œ qs/3 or 


3u 
s 


6.2.12 


at equilibrium. As expected, the frequency is higher than for autosomal 
dominants, but much less than for complete recessives. 

If s is large the selective value of homozygous females is almost totally 
irrelevant as far as determination of gene frequencies is concerned. Very few 
such females exist; with a lethal there would be almost none at all. Likewise, 
the selection against heterozygous females is unimportant unless it gets to be 
comparable to s in magnitude. 


6.3 Equilibrium Under Mutation Pressure 


Ordinarily one regards selection as the strongest force influencing gene 
frequencies, with mutation providing a steady input of new variability. On 
the other hand, there is growing evidence that some, perhaps a large fraction, 
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of DNA changes are nearly enough neutral that mutation rates become an 
appreciable factor. With such weak forces we should expect the consequences 
of random gene frequency drift to be important. That we shall consider later 
in Section 7.2. In this section we shall treat the subject deterministically, as if 
the population were infinite. 

Consider a series of n alleles A,, 42, 45, ..., An. The decrement in 
frequency of A; due to mutation from 4, to other alleles is 


(2 Mies 


where u;; means the rate per generation of mutation from A; to A; and p;, 
as usual, is the frequency of A;. The increment jn A; alleles will be the sum of 
the mutation rates from other alleles to A4; , or 


2 uji Pj 
J 
Thus the net increase in frequency of 4; per generation is 
Ap; — — pi 2: uj; + 2: Uji pj, 6.3.1 
J J 


as given by Wright (1949, for example). 
The equilibrium is obtained by setting Ap; = 0 for each value of i and 
solving the series of simultaneous equations. For two alleles, p; = 1 — p, and 
we have 
Ap, = —pyt,2 + (1 — py)uay, 
and when Ap, — 0, 
| Un 0 ? 
uztu UD’ 





bi 6.3.2 


where u and v are the mutation rates from and to allele A,. 
For three alleles we have the equilibrium equations 


— (u12 + 43)p1  21p2 + 3 p3 = O, 
Uy2P, — (u51 + u53)p2 + U32P3 = 0, 6:3:3 
u13 Py + M32 p2 — (U31 + ú32)P3 = O. 
These three equations are not independent, since p, + p; + p, = l. We can 
reduce this to two independent equations by replacing p, by 1 — p, — p; in 


two of the equations and solving. A convenient way of getting the solution is 
to write the matrix of the coefficients 


— (u2 + 143) u21 KESI 
uy2 —(u2, + u23) Uu42 =A. 6.3.4 
U3 u23 — (u3; + u32) 
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Then 

poe 
1 D , 

5; 

= ; 6.3.5 

Pa D 

P LIE 
3 D 5 


where A,, is the 2 x 2 detriment obtained by deleting row 1 and column | 
from A. Likewise A;; is gotten by deleting the second row and column, and 
so on. D, the denominator, is the sum of the three numerators. D is also the 
value of the determinant obtained by replacing any row in A by l’s. (Some 
simple rules for evaluation of determinants are given in A.7.) 

The extension to four or more alleles is straightforward. The determinant 
is larger, but the rules are the same. 

With typically low mutation rates the approach to equilibrium is very 
slow. For example, with two alleles, 


Ap, = —u3p4 t Ui pa 6.3.6 
Now let 

Pi 3 Pi + Č, 

P2 = Î2 — Š, 
where f, and f, are equilibrium values. Then substituting into 6.3.6 we get 

Ag = Ap, = —(u5 + u3))6 — 162 fy + Ur Po. 
But the last two terms add up to 0 (see 6.3.2), so 

AG 


-> =u, +u =} U. 6.3.7 


¢ 
The approach to equilibrium is at a rate equal to the total mutation rate. 
f} u= 1075, about 70,000 generations are required to go halfway to equi- 


librium. So, to nobody's surprise, we learn that the approach to mutational 
equilibrium is a very slow process. 


6.4 Mutation and Selection with Multiple Alleles 


When discussing the equilibrium between mutation and selection we con- 
sidered only two alleles and ignored reverse mutation. It is expected that the 
wild-type gene can mutate to many different states, as we have discussed in the 
previous section. If the mutant states are individually rare and are selected 
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mainly as heterozygotes then there will be little chance for interaction, since 
hardly ever will more than one of a set of mutant alleles occur in the same 
individual. The total frequency will be governed mainly by the total mutation 
rate from the normal to all mutant alleles. 

Experience with Drosophila mutants shows that the combinations of two 
recessive mutants with visible effects are often intermediate between the two 
homozygous mutants. In this case there will be little interaction. So we suspect 
that in most cases multiple allelism doesn’t change the picture much. 

We shall consider the question only briefly. For this it will be expedient to 
adopt a continuous model and assume that the gene frequency change can be 
written in the form 


d 
i = pm, SmiS Pa X apre 6.4.1 
We get this by combining 5.3.5 with 6.3.1. 

For reasons mentioned in Section 6.2 reverse mutation can be ignored 
with little loss of accuracy if the deleterious effects of the mutants are large 
relative to the mutation rates. If the last term is 0, by setting dp;/dt = 0 for 
equilibrium we obtain the pleasingly simple result 


= Y Uo js 6.4.2 
J 


where the subscript 0 designates the normal allele (which, of course, may be a 
population of indistinguishable isoalleles) and the j indicates any mutant 
allele. 

In words: At equilibrium the average excess in fitness of the wild-type 
allele is equal to its total rate of mutation to all mutant alleles. 

This depends on the fitness being measured in Malthusian parameters. 
It also assumes that reverse mutation to the wild-type allele can be neglected. 
However, no restriction is placed on the number of alleles, on the mating 
system, or on the rate of mutation from one to another mutant allele. We shall 
use this principle to find the equilibrium frequencies of mutant genes. 


1. Fully Recessive Mutants We place no restriction on the interaction 
between mutant alleles, but all are recessive to the wild type, A,. For algebraic 
simplicity we set the fitness of the wild type at 0, measured in Malthusian 
parameters. Let the fitness of the mutant type 4; 4; be —5;;. Then from 6.42, 


o- m= È Pusu = 2, Mor 6.4.3 


where the summation extends over the n — | mutant alleles. The average 
reduction in fitness caused by mutant phenotypes (homozygous mutants and 
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combinations of two mutants) is 


um 4 zu. 6.4.4 
ij 


from which 


se 





$ 


$ Pj= 2 Ln 6.4.5 


(his says that the total frequency of mutant phenotypes is the ratio of the 
total mutation rate (forward) divided by the (weighted) average selective dis- 
advantage of the mutant phenotypes. The fitness is measured in Malthusian 
parameters. 

For two alleles the discrete generation analogy is 6.2.3. 


2. Partially or Completely Dominant Mutants Once again we let the 
wild-type (4o Ao) fitness be 0. The fitness of the mutant homozygote 4; A; 
is —5;;, of the mutant heterozygote A; A, is —5;;, and of the normal-mutant 
heterozygote Áo Á; is — h; su. From 6.4.2, 


h.s.. Pa. 
=) eu 9 — m -Y ug. 6.4.6 
i Po i 
But, 


—-ñm=2 X h, s;; Po; + 23 Pi, Siz- 
i ij 


where, as before, the summation is over all mutant alleles. If there is appreci- 
able dominance the last term is very small relative to the others. Omitting 
this, and noting that 


X PoihiSii 


S 
X Po, hs 


we obtain 











2 Uoi Do Y Uoi 
Pops == x x = x ; 6.4.7 
2 = hs 2Py — 1 hs "n 


With Hardy-Weinberg proportions Po; = pop; , and 


6.4.8 


Yogi. 


hs 
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So, as a first approximation based on strong selection against heterozygotes, 
the total frequency of mutant alleles is the total forward mutation rate 
divided by the average selection against heterozygotes. 

With weaker selection the situation is more complex, and we shall go 
no further with it. 


6.5 Selection and Migration 


The subject of migration and population structure has become very extensive 
in recent years. Various models of structure ranging from a continuous 
population with limited dispersal to completely isolated islands have been 
studied. We shall consider only one model, the island model of Sewall Wright 
(see, for example, his 1951 paper). Later, in Sections 9.2 and 9.9, this and other 
models will be treated stochastically. 

The species is thought of as broken into a number of subpopulations, 
largely isolated from each other, but with some exchange of migrants. We 
let p; be the frequency of an allele of interest in the ith subpopulation, and 
p be the frequency in the whole population. In Wright's model a fraction M 
of the population are replaced by migrants each generation. The migrants are 
assumed to have a gene frequency equal to that for the whole population p. 

The change in allele frequency in the ith subpopulation per generation is 


Ap; = —Mp; + Mp = — M(pi — p). 6.5.1 


Since on this model the effect of migration is linear in the gene frequencies, 
there should be a correspondence with mutation which is also linear. If we 
rewrite 6.5.1, 


Ap; = — Mp: + Mp — Mp: P + Mp: P 


z z 6.5.2 

= —M(1 - Dpi + MPC — pj), 

we see that this corresponds in form to 
Ap, = —ijapy + uz (l — py). 6.5.3 


So we can carry the same equations from mutation to migration by setting 
M(1 — P) = u2 = u, 


E 6.5.4 
Mp = u21 = V. 


We shall use this correspondence in Section 9.2. 

In analogy with 6.4.1 we may regard selection and migration both as 
continuous processes and write 

dp; 


— = p(m. — m) — M(p.— p. 6.5.5 
di pm; — m) (p: — p) 5 
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We must keep straight that m is the average fitness of the allele of interest in 
the ith subpopulation whereas p is the average allele frequency in the entire 
population. We are using m for the Malthusian parameter of selection and 
M for migration. 

It is to be expected that the immigrants coming into a subpopulation from 
the outside will be less well adapted to the local conditions since they have 
not had the benefit of previous selection in the local environment. This is quite 
analogous to an input of different genes by mutation (although the migrant 
genes have been pretested in an environment that is not wholly unrelated, 
and hence are likely to be less deleterious than new mutants). 

To find equilibrium conditions we can equate 6.5.5 to 0. In general the 
selection term will be a cubic function of the gene frequencies. In the absence 
of dominance the expression is only quadratic, so we shall consider this. 
Using 5.3.8 and equating dp/dt to 0 we have at equilibrium 


sp(1 — p) - M(p — p) = O, 6.5.6 
where, dropping the subscripts, p is the frequency of the gene (say A) that is 


locally favored, s is the selective advantage of AA, and s/2 that of Aa, both 
measured in Malthusian parameters. The relevant solution is 


po 37 M + VM = 3  AMsp 
E 2s l 


For example, if M = s, p = mi P. 

It is clear that if migration is large and selection weak all the populations 
will tend to become alike. On the other hand, if |s| » M and the selective 
values differ from one subpopulation to another, there will be considerable 
local differentiation. This model has been extensively discussed by Wright 
(1940, 1951). 


6.5.7 


6.6 Equilibrium Between Migration and Random Drift 


If the population is broken up into subpopulations and these subpopulations 
are small there will be random drift of gene frequencies among the sub- 
populations so that they will drift apart. Migration from one to another will 
counteract this effect. We shall discuss this briefly now by elementary pro- 
cedures and then return to it in Section 9.2 with a more sophisticated stochas- 
tic treatment. 

As in the last section we let M be the amount of exchange each generation 
by migration. From equation 3.11.1 the increase in autozygosity in a sub- 
population is given by 


1 1 
= — + 1-— =r 6. 
f 2N. ( F) ! net 
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where N, is the effective number in the subpopulation. However, the genes 
will remain identical only if the individuals carrying them have not been 
replaced by migrants. The probability that neither of the two uniting genes 
has been exchanged for a migrant gene is (1 — M}. (This is not quite exact 
because our model assumes that self-fertilization is possible, but unless N, 
is very small this is a trivial correction.) 

The equation, including a correction for random exchange of genes with 
outsiders, is 


i= lsat ( - acp - ar 6.6.2 
Letting f, = f,.., =f for equilibrium, 
f- EDEN - My : 6.6.3 
2N, — QN, —1)(1 — M)! 
When M is small so that M? can be neglected, 
f po : 6.6.4 


“4N.M+1—2M AN,M Y 


We shall find exactly the same formula for mutation in Section 7.2. 
This is not surprising because of the mathematical equivalence of mutation 
and migration in Wright's migration model. 

If M « 1/AN,, then f becomes large and there is considerable local 
autozygosity. Contrariwise, if M > 1/4N,, the migration swamps the local 
subpopulation and the whole thing becomes effectively one panmictic unit. 

It is impressive that the amount of migration needn’t be very great. 
A fraction 1/N, of the population means one individual in a population of 
N,, so if the number of migrants is much more than one per generation, there 
is little local differentiation. This is mitigated somewhat, however, because 
migrants tend to come from neighboring subpopulations, rather than being 
a random sample of the entire population, and neighboring genes are likely 
to be somewhat alike. Hence the swamping effect of migration may be less. 

Equation 3.12.3 connects the inbreeding coefficient of a subpopulation 
with the variance in gene frequency caused by random variation among the 
subpopulations. From this 


V 
P 6.6.5 


{= 9" 


where V, is the variance in gene frequency among subpopulations and p 
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is the average frequency in the whole population. Substituting this into 
6.6.4 gives 


E — p) 
y, = —————— 6.6. 
PT AN,M +1" à 
which is Wright's formula relating the amount of random differentiation 
among partially isolated subpopulations to the amount of migration. 
We shall derive the same formula again later (9.2.10) by a quite different 
procedure. 


6.7 Equilibrium Under Selection: 
Single Locus with Two Alleles 


We start with a simple case—one locus, random mating, and two alleles. 
We shall use a discrete generation model; in general the conclusions are the 
same with a continuous model. 

Suppose the generations are enumerated as zygotes and that the fitnesses 
of the three genotypes 4,4,, 4,4;, and A, A, are w,,, w,;, and w3,. With 
random mating the zygote frequencies, counted before selection, will be 
P1, 2p, p; , and p} , where p, and p; are the allele frequencies and p, + p; = 1. 

From 5.2.13, 


_ PiP2 [pir — 3) + Pawi — W22)] 


Ap, — 6.7.1 
Ww 
Equating this to 0 to find the equilibrium values gives 
Wo2—-W 
i, = Mi 6.72 


? 
Wii — 22 + 25 


in addition to the two trivial values p, = 0 and p, = 1. 

We can notice immediately that some restrictions are imposed by the 
fact that p, must lie in the range O to 1 and therefore cannot be negative. If 
we write the denominator as (wi; — w12) + (22 — w,2) we see that O < p, < | 
only if both terms in the denominator have the same sign. So we see that one 
condition for an equilibrium is 


Wii < W12 > W22 
Or 

Wi; > Wiz X W22. 6.7.3 
To have an equilibrium other than 0 or 1 the heterozygote must either be less 


fit or more fit than either homozygote. Otherwise the only equilibria are 
p; —500rp,- I. 
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To investigate the stability of the equilibrium consider a small displace- 
ment č from the equilibrium f, so p, = f, + č. Then p; must be f, — č and, 
noting that Ap, = 0, 


Ag = Ap, — Af, 
= Ap, © fifa (bi + Ov, — wi) + (P2 — OQ, — W32)]/w 
= pibi(w;, — 2wi2 + w22)E/w, 6.7.4 
since 
By, — 12) + B2(w12 — W22) = O. 


When w,, — 2w,5 + w22 < 0, AG/E is negative, and its absolute value is 
less than one. This means that f, is a stable equilibrium in that a displace- 
ment from this point is followed by a tendency to return to the point. 

When w,, — 2w,2 + w22 > 0, then A¢/¢ is positive. A displacement is 
followed by an even larger displacement in the same direction, so the equilib- 
rium is unstable. 

Selection favoring the heterozygote leads to a stable equilibrium, 
with the equilibrium value of p, given by 6.7.2. If the heterozygote is selected 
against, the population tends to move away from the unstable equilibrium 
toward p, = 0 or p, = |. Whether the A, or the A, allele is fixed depends on 
which side of the equilibrium point the population starts from. 

A locus with a superior heterozygote tends to persist in the polymorphic 
state so that it contributes quite disproportionately to the variability of the 
population. A locus with an inferior heterozygote ultimately is fixed at gene 
frequency 0 or | and does not contribute to the variability. 

There is another way of testing the stability of an equilibrium. We saw in 
Section 5.5 that under natural selection with a single locus, random mating, 
and constant fitness coefficients the fitness always increases. With a continuous 
model this follows from the Fundamental Theorem of Natural Selection. 
With a discontinuous model it follows from 5.5.14. 

The mean fitness is 


w= Pi"u + 2p,Pp2Wi2 + pi W22; 6.7.5 


from which, recalling that p; = 1 — pi, 


dw 
dp, = 2pi(vi1 — w12) + 2p3(w21 — W22) = 0, 6.7.6 
1 


leading to the same equation as before, 


W22 — W12 


re ., 
Wi, — 2W12 + W22 
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So, as expected, the equilibrium point is where the fitness is stationary— 
either maximum or minimum. 

Differentiating again, 

TN 

e = 2(w,, — 2w42 + w22). 6.7.7 
When this is negative the equilibrium point is a relative maximum and there- 
fore stable. When the second derivative is positive the equilibrium is unstable. 

When there are multiple alleles the conditions for a maximum are more 
involved, as we shall see in the next section. 

For a particularly lucid elementary exposition of the various kinds of 
equilibria under selection, see Li (19676). 


6.8 Selective Equilibrium with Multiple Alleles 


Again we assume random mating and a discrete model. There are 7 alleles 
Ai, 42, ..., A, with frequencies pı, p2, ..., p,. Let w;; be the fitness of 
genotype 4; 4;. Then, from 5.2.1, the change in frequency of allele A; in 
one generation is 


Api SPEO). 6.8.1 
wW 
where 
W= c PiP; Wij 
if 
and 
V = Y Pj Wi; 


At equilibrium when Ap; = 0, 
w = W, 6.8.2 


and we reach the quite reasonable conclusion that at equilibrium the average 
fitness of all alleles is the same and equal to the population average fitness. 
If this were not true, those alleles with the higher fitnesses would increase 
and the population would not be in equilibrium. 

To investigate the stability of the equilibrium we should like to use the 
principle employed in the preceding section—that fitness is maximized at the 
point of stable equilibrium. For the continuous model we know this from 5.5.9, 
which extends readily to multiple alleles. We also know from 5.5.14 that this is 
true with a discrete generation model for random mating with two alleles. 
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We would expect this to be true for multiple alleles also, and this has 
been shown. For a discussion of this problem, see Scheuer and Mandel 
(1959), Mulholland and Smith (1959), and Kingman (1961, 1961a). We shall 
now demonstrate this. If you are willing to accept this without proof, you 
may wish to skip this section and proceed directly to equation 6.8.11. 


Demonstration That Fitness Always Increases in the Neighbor- 
hood of the Equilibrium If we use a caret to designate equilibrium 
values, 


v = Y wis Bj = 9 =D) wi; Pi B;- 6.8.3 
j ij 
To simplify the calculations, we express the fitnesses relative to the mean 
fitness at equilibrium, so we let 


MA 


ij ĝ 7 
In this measure 


Ñ=}, W; 8; = W - 1. 6.8.4 


The formula for Ap; is the same in the new units; 


2 p us 


Our interest now is in the behavior of W near the equilibrium point. 
Let č; be a small deviation from f, as in the last section, so that 


Pi = Pit Ši (Y €; = 0). 


The change in W due to these displacements is 


à8W-y Wf; + EN; + 6) - 1 


Api 


E 2). W;;p;6, + » W;;6;6;. 
But since 
» Wip;6-— 2, č; 0, 
we have 


àaW-y W, čič; 6.8.6 
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Similarly, if we consider the change in W; due to these displacements, 


oW, 72 Wi; + $) — D ub; 
=) Wb; =, 
j 


6.8.7 


so 
OW, = e£. 6.8.8 


The change in gene frequencies in one generation near the equilibrium 
point, from 6.8.5, 6.8.6, and 6.8.8 is approximately 


Ač; = Îi Ei . 6.8.9 
The change in mean fitness W in one generation due to natural selection is 
AW= 2. W, (pi + ApX(p; + Ap) — 2, Wi bip; 
ij 
= 25, W, ^p; +}, Wi; Ap; Ap;, 
i ij 
which, in the neighborhood of the equilibrium point, reduces to 
AW 2 2Y (WF, + 5W,) AE; + > Wi; A6; Ag; 
i ij 
-2» fici + >, Wij BiB; ee), 
i ij 


from 6.8.4, 6.8.8, and 6.8.9. 
Now consider two quantities, 


A=2) fie 
and 

B= 3 Wi; pie, + 6), 
both of which are nonnegative. Since 

B= 22:9 jPiPj 6 Bea ad ij Bi Pj Ei £j, 
then 


LW ij Di Pj £i t; = W;; 


> 
> 


i 


W, pc? 


x " 


i 


NI ud wis 


(B — A), 


since W, — 1. 
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Using these quantities, the change of W in one generation near the 
equilibrium is 


AW = A - 4(B— A) - (4 B) 2 0. 6.8.10 


This shows for multiple alleles, as 5.5.14 did for two alleles, that (with 
random mating and constant w,,’s) the fitness always increases unless the 
e; sare all 0, that is to say, unless the population is already at equilibrium. 

We now use this principle to derive stability conditions for the multiple 
allele case. 


Equilibrium Allele Frequencies The gene frequencies at equilibrium 
may be obtained directly by solving the simultaneous equations from 6.8.2 
using the definitions of the w;'s (see 5.2.5). 


Wiipi; + Wi2Pp2 °t + WinPn =W 
W2ipi t W22P2 t °°* + wp, = W 6.8.11 


WniPi + Wa2 P2 ae ae Wan Dn = w 


The solutions, from Cramer's rule, are 


p= s 6.8.12 
S A » A; 
where 
Map. 19032 Win 
Ww w w 
^ £o CORE 2n 6.8.13 
Way Wn2 VES Wan 


and A; is the determinant gotton by substituting l's for all the elements in 
the ith column of the determinant A. The latter part of 6.8.12 follows from the 
fact that Y' p; = 1. The average fitness at equilibrium is # = A/)' A;. 


Stability of the Equilibrium We have seen that fitness always increases 
except at the equilibrium, so the stability condition is that w is a relative maxi- 
mum. This means that ów must always be positive for small deviations from 
the equilibrium. Mathematically this is assured if the quadratic form 6.8.6 
is negative definite (see below for definition). 

Since Y. č; = 0, then 


Ča = -X& 6.8.14 
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Substituting this into 6.8.6, and using the original fitness scale, 


n-i n-i 
Om = 5, y lijčičj, 6.8.15 
i=1 j=1 
where 
[jj = Wig — Wig — Wyn TOW (wij = Wji)- 6.8.16 


The quadratic form on the right side of 6.8.15 is negative definite if and 
only if 


t t hi 2 tha 
11 1230, isc ds dj, 6.8.17 
fz f22 





t4, <0, 
[31 [32 [33 


and so on up to order n — I. 

The second condition is that all gene frequencies be positive. This can be 
done as follows. Let T = [t;;] be the matrix of the quadratic form. It turns 
out that the determinant |T| is equal to the denominator in the right term 
of 6.8.12. So 


IT| 2 34. 
i=1 
However, 6.8.17 requires that 
(—1)"""|T| > 0, 
SO 
(CI) !A;20 (i=1,2,...,n) 6.8.18 


in order that all the gene frequencies are positive. 

To summarize: The conditions for a stable equilibrium are 6.8.17 and 
6.8.18. 

It may be convenient mnemonically to remember that t;; may be written 
symbolically as (i — n)(j — n), which gives ij — in — jn 4- nn. 

Exactly the same arguments apply to the continuous model. The necessary 
and sufficient conditions are the same, except that the w,,’s are replaced by 
Malthusian parameters. 

In fact the first report, by Kimura (1956a), was for a continuous model. 
Mandel's (1959) formulae for the discrete case are equivalent. 

Those reading Mandel’s papers can verify that, for example, his condition 


aq, d, | 
ant ann l 
1 1 6 





-0 
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is the same as t,, < 0, since the expanded determinant is 
—(a — Ayn — Any + Ann) = — fii. 
In the case of two alleles the conditions become 


(1) tii = Wi, — 2wy2 + W22 >Q, 





] w 
(2) -a= -|i oe = W12 — W22 > 0, 
22 6.8.19 
w 1 
—-A,=- ae iwaw. 





These are equivalent to the condition w,, < w,; > w22 that we discussed 
earlier. 
For three alleles 


Wit Wi2  W13 
A=|W21 W22 Wasi, 
W31 W32 W33 


| wi; Wis wı l wj Wi; Wi2 ] 
A, =|! w232 wai A,=|w2, l1 wj A3 —|w;; wa; l| 
l w32 W33 Wii l was W31 W3 l 


thy = (1 — 3)? = wy — Wiz — W31 + W33, 
t2 — (1 — 3X2 — 3) = wį2 — w13 — W32 + W33 =a, 


SS 
t22 = (2 — 3)? = w22 — W23 — W32 + W33. 








(1) 4,<0, o t>o, 
t21 122 
(2) A >0, A, » 0, A, >Q. 
But, since 
hi h| 
=A, + å, + A3, 
tor 122 








in the 3-allele case the necessary and sufficient conditions are: 
A, » 0, A, >0, A; > Q, tı <0. 6.8.20 


With multiple alleles it is not necessary that each heterozygote be fitter 
than any homozygote; there can be a stable equilibrium without this. On 
the other hand, it is not sufficient for all heterozygotes to be superior; some 
such systems are unstable. However, the population fitness, w, at equilibrium 
must exceed the fitness of any homozygote (Mandel, 1959). For a general 
review and extensions to inbred and polyploid populations, see Li (1967b). 
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6.9 Some Other Equilibria Maintained 
by Balanced Selective Forces 


The number of possibilities for stable and unstable equilibria is enormous, 
even when only a single locus is considered, as we have just seen. When other 
complications are added, such as multiple loci, linkage, and variable selection 
coefficients, the possibilities become enormous. We shall mention only a few. 


Sex-linked Polymorphism Consider a single locus with two alleles and 
with random mating. Let pf and pł be the frequencies of A, and A, in the 
male gametic output and, similarly, pf*and p$* be the corresponding fre- 
quencies in the female gametic output. 

If we denote the fitnesses of A, and A, males by w, and w, , and those of 
A,A,, 4,42, and A; A; females by w,,, #12, w25, then the frequencies of these 


MALE FEMALE 

GENOTYPE A; A2 A,4, A142 A242 
** * x * * 

FREQUENCY  pi*  pi* Ptpt* pipi -pr'pi pipi 

FITNESS Wi W2 Wii “Wi W22 


Then the frequencies of A, in the male and female gametic outputs for the 
next generation are 
x; wi pi” 
Pi S—— 3 
w 


6.9.1 
pi" wi Pipi“ + łwia(pi p3" + Pt pi) 
re S LENIN du MUN LR: 


w** NE 
where w* and w** are respectively the mean fitness in male and females, 
and pł = 1 — pt, pi* = 1 — př*. 
In the following calculation, we will consider gene ratios rather than gene 
frequencies. Let 


and 
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be respectively the gene ratios in male and female gametic outputs; then 
6.9.] is equivalent to 


x =i 
W2 


Wi2 
WiiXy + =U + y) 


6.9.2 


w 
(x+y) + w22 


At equilibrium in which x' = x, and y' = y, letting 
w 
— =k, 


W2 
we obtain from 6.9.2. 
wiXky + y) +222 © 


By solving this for y and noting that x = ky, we obtain the gene ratios at 
equilibrium ; 
p= (k + 1)wi2 — 2w2 


(k + wi E 2kwi, : 6.9.3 
R=ky. 
Since ? is positive, it is necessary that either 


(k + 1)wi2 — 222. > 0, (k + 1)w,5 — 2kw,, > 0, 
or 6.9.4 
(k + l)wy2 — 2w22 < 0, (k + 1)wv,4 —2kw,, <0. 


Let us now investigate the conditions for stability of the equilibrium. 
From 6.9.2, considering the change in y over two generations, we get 


"o 2w ,kyy' + wi2(ky + y) 


- 6.9.5 
Wi 2(ky + y) + 2w;; 


First, consider the situation in which the frequency of A, is very low in 
the population so that y is very near to 0. Neglecting higher order terms in 
6.9.5, we get 

y" = 223 (ky + y) 

2w23 
Or 


2w5;y" — W12y' — w;5 ky = 0. 6.9.6 
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For the equilibrium to be stable, it is necessary that y tends to increase. 
Using the procedures of Section 3.8, the characteristic equation of 6.9.6 is 


2w32 A? = Wi2d em W12k = 0. 


It has two roots, 


a ie + ./wi2 + 8w12 W22 k 


4w54 


El 


one positive and the other negative and the former has the larger absolute 
value. For y to increase, it is necessary that this root be larger than unity. 
The condition for this is satisfied if 


W12(1 + k) — 2w;, > 0. 6.9.7 


(Note that if w,, > 4w;; , the larger root is larger than unity but this condition 
is weaker than 6.9.7.) 

Next, consider the situation in which the frequency of A, is very high, 
or the frequency of its allele 4, is very low. In this situation it is necessary 
that A, tend to increase for the equilibrium to be stable. From the considera- 
tion of symmetry, it is easily seen that A, tends to increase if 


wil + k^!) — 2w > 0. 6.9.8 


Conditions 6.9.7 and 6.9.8 satisfy one of the set of conditions in 6.9.4. 
Thus the conditions for the equilibrium to be stable are 


2 
W12 > | 77 J 22> 


l+k 
6.9.9 
NE ) 
Ww dum , 
12 1 Lk! 11 
or alternatively, noting that k = w,/w,, 

W12 2w, 

Wir WW 
6.9.10 


Wi 2w5 
ae 


W2 Wy + W3 


(Kimura, 1960a). This agrees with the set of conditions first given by Bennett 
(1957). For a different derivation of the equivalent conditions and for the 
effect of inbreeding on the equilibrium, see Haldane and Jayakar (1964). 
Note here that the arguments leading to 6.9.7. and 6.9.8 are valid only when 
there is only one equilibrium point. 
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As an example of the application of formula 6.9.10, let us consider a 
mutant gene A, which is advantageous in males (w; > w,) but deleterious in 
females (w223 < w,,). If A; is recessive in females (w,2 = w;,), a stable poly- 
morphism is possible provided that 

2w2 Wit 


w TW; W2 
is satisfied. On the other hand, if A, is dominant in females (w,; = w22), 


the second inequality in 6.9.10 is violated and no stable polymorphism is 
possible. 


Polymorphism Due to Selection of Varying Direction In this sec- 
tion, we will consider a situation in which selection fluctuates from generation 
to generation such that it sometimes favors one phenotype and sometimes 
another. 

Let A, and A, be a pair of alleles at an autosomal locus and assume that 
A; is fully recessive with respect to fitness. Let w, be the selective value of the 
recessive (45 A2) relative to the dominant (4,4,, 4,45), and, let p, and 
q,(= | — p,) be respectively the frequencies of A, and A, inthe tth generation. 
Under random mating, the frequency of A, in the next generation is 


|t B4 Pt 
Died = == = 


LI 6.9.11 
Wy W, 


where 
W, = P? + 2b. d, + 9,q;. 


First, consider the case in which 4, is very rare. In this case, the ratio 
Poalp, ìs Vw, at the limit of p, + 0. This means that the proportional increase 
or decrease per generation of the frequency of the dominant allele (4,) will 
be large and depends almost entirely on 1/w,. Thus 


Diei Pea x Pr Pi 1 x 1 1 
~ = i mee ae 6.9.12 
Po P: Pr-1 Po W Wi Wo 


In order that the polymorphism be maintained, it is necessary that the 
frequency of A, tend to increase when it is rare. Namely, 

E | 

Il —>1, 


i=0 Wi 


or 


t 
I[w; <1. 6.9.13 
i=0 
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Next, consider the case in which the frequency of A, is very high, or equiva- 
lently, the frequency of A, is very low. In this case, 


E qp, + 4) 
W, 


Qr«1 6.9.14 
and the ratio q,+1/4, is unity at the limit q, — 0. This means that the propor- 
tional increase or decrease per generation of the frequency of the recessive 
allele (A2) is extremely small when it is rare, and the consideration of the ratio 
is not very useful. Also the difference q,., — q, is of the order of q? and there- 
fore tends to 0 very quickly as q, > 0. On the other hand, the reciprocal of 
q, is very large and the difference of this quantity between two successive 
generations turns out to be finite and therefore more useful. That is to say 
we magnify the difference which actually is very small. Let Q, — 1/q, ; then 


e (1 — wQ., — 1) 


041-A= "UE pu t 6.9.15 
and, at the limit of Q, — oo, this has the limit | — w,. 
Therefore 
t 
0.41 — Qo La — wj). 6.9.16 


In order that the polymorphism be maintained, A, must increase in this case, 
or the reciprocal of its frequency must decrease, so that 


SU wei, 
zo 


or 


mM- 


l 
— y wi» l. 6.9.17 
t+1,; 


0 


This treatment shows that the sufficient conditions for polymorphism are 
6.9.13 and 6.9.17. Namely, the arithmetic mean of the fitness of the recessive 
is larger than unity while its geometric mean is less than unity. This elegant 
theorem was first proved by Haldane and Jayakar (1963). 

These conditions would be fulfilled, for example, if the recessives were 
5%-10% fitter than the dominants but an epidemic disease killed off all the 
recessives every twenty generations. The geometric mean is less than 1 while 
the arithmetic is larger. After each epidemic the frequency of recessive genes 
may decrease suddenly, followed by steady increase until the next epidemic. 


Multiple-niche Polymorphism We shall consider briefly a simple model 
of Levene (1953). Suppose there are two alleles A, and A, and mating is at 
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random. The zygote frequencies are p°, 2pq, and q?. These genotypes are 
then distributed randomly into separate niches, each of which is different 
as regards the selective values of the three genotypes. In the ith niche the 
relative fitnesses are: 


A.A, A,A2 A2rA2 
Wi, = 1-5; Wi2-l w22 =l1— 4 


We make one more assumption—that the fraction that matures in the 
ith niche is k; (Y, k; = 1). After maturation the progeny from all niches are 
randomized before mating. 

In this model the average fitness in the ith niche is 
=p? (1 — s) + 2pq t a*(1 — t) 


6.9.18 
=1-—p’s,;-q't;, 
and the average fitness of allele A, in the ith niche is 
wi; =pl — s) +q =1 -— psi. 6.9.19 
From 5.2.6 the gene frequency change in the ith niche is 
Am pw, — Wi) — pa(at; — ps) 
Wi W; 
Averaging over all niches, 
S; 
Ap = pq 2 k, 2 DIT. 6.9.20 
Wi 


There will be at least one stable intermediate equilibrium if p increases 
when it is small and decreases when it is large. So we shall use this to determine 
sufficient conditions for a polymorphism. If d(Ap)/dp > 0 when p — 0 and 
d(Ap)dp < 0 when p — 1 the conditions will be satisfied, since Ap = 0 at p = 0 
or |. 








From 6.9.20, 
UE 
Ix 
= Lk; Wird : 


since ) k; = 1. 
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So one condition for a stable equilibrium is 


1 
È ki 


W22, i 


(x k; =) <1. 6.9.21 


Similarly, the second condition based on the limit as p — | is 


|J (x k; : | « 1. 6.9.22 
Wit 


In words: If the weighted harmonic means of the fitnesses in the various 
niches show overdominance for fitness, there is a stable polymorphism. 

Since the harmonic mean is generally smaller than the arithmetic, it is 
possible for there to be a stable polymorphism with the arithmetic means not 
showing heterozygote superiority in fitness. So, with multiple niches the 
conditions for a stable polymorphism are less stringent than with constant 
fitnesses. 

Prout (1968) has shown that the necessary conditions are actually 
slightly weaker than we have indicated. We can actually replace < by < in 
equations 6.9.2] and 6.9.22. See Prout's paper for a discussion. This model has 
also been discussed by Dempster (1955a). 

The Levene model that we have discussed was chosen deliberately to 
demonstrate the opportunity for polymorphism under a situation rather 
unfavorable to polymorphism. If the organisms are able to select their niches, 
then the opportunity for polymorphism is obviously greater. Another situa- 
tion is when the opportunity for migration between niches is restricted so 
that there is more opportunity for maintenance of polymorphism by local 
differentiation. We shall not discuss these mathematically; in fact this subject 
has received very little attention. For a discussion of what has been done, 
see Moran (1959) and Levene (1967). 





> 1, 


or 








Two-locus Polymorphism When multiple alleles at multiple loci are 
considered, the problem quickly becomes very complicated. There are numer- 
ous possibilities for stable and unstable equilibria, particularly if the fitnesses 
of the different genotypes are not constant. We shall deal with only the simplest 
case, however. Assume that the population is mating at random and consider 
two loci with two segregating alleles at each. 

With a single locus and random mating we saw that the fitness never 
decreases. If this is also true for two loci, then we can apply the kinds of 
stability criteria that were used in Section 6.8. If the linkage is loose so that 
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the gametic types are in linkage equilibrium, a minor extension of these 
methods is directly applicable. It has been given by Kimura (1956a). 

But it is not generally true that selection always leads to an increase in 
fitness. However, under one circumstance it does. We showed in Section 5.7 
that the Fundamental Theorem of Natural Selection applies if the population 
is in quasi-linkage equilibrium. In Section 5.9 the same condition was shown 
to be sufficient for the Maximum Principle to work. So we should expect 
that under this condition there would be a stable equilibrium at the point 
where fitness is maximized. This is shown by doing almost exactly the same 
operation as we did in Section 5.9. 

Let p,, P2, p3, and p, be the frequencies of the four chromosomes 
AB, Ab, aB, and ab and c the proportion of recombination between the two 
loci. As before, we regard unlinked loci as a special case where c = 1/2. 
The equations of gene frequency change (5.4.4, 5.9.25) are 


dpi 
dt 
where 


D = pipa — P2 P3 
m; =}, myP;, m =}, mi Pi Pj 
J J 


= p;(m; — m) — k;cbD, 6.9.23 


1 when i= 1, 4 
— 1 when i = 2, 3 


k= 


and b is the birth rate of double heterozygotes. 
At equilibrium, when dp,/dt = 0. 


p,(m, — m) = —py(m; — m) = —py(m, — m) = p4(m, — m) = cbD. 
6.9.24 
The solution of these equations gives the equilibrium values. 
Now we wish to show that the equilibrium is where fitness is maximum. 
We impose two side conditions, 


pl 6.9.25 
and the quasi-linkage equilibrium relation 


= Pi P4 
P2P3 


or, more conveniently, 


Z = constant, 


log Z = log p, — log p; — log p, + log p, = K, 6.9.26 


a constant. 
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Using A and B as Lagrange multipliers, as we did in Section 5.9, we 
maximize 


y =m —2A4() p; — 1) — 2B(log Z — K) = 0. 6.9.27 
Differentiating, 
ow 2B 
Bp ee mug) 04 T 
2B 
= 2m, 2A— — —0, 
Di 
ð 2B 
LEY NO, 
ap, P2 
ð 2B 
A E NEUEN 
ôP, P3 
2B 
— = 2m, — 2A —- — =Q. 6.9.28 
Op, P4 


Multiplying the first equation by p,, the second by p,, the third by p,, and 
the fourth by p, leads to 


p,(my — A) = —py(m; — A) = —py(m, — A) 


c pala p A) =B. 6.9.29 


To evaluate A and B, add the first and fourth equations of 6.9.28 and 
subtract the second and third. This gives 


l 
A m d 


or in the symbols of 5.4.8, 


E 


B=-., 9. 
P 6.9.30 


Now, again multiply the four equations 6.9.28 by p,, P2, P3, and p, and add. 
This gives 


Y pm — A Yi = 0, 
or 


Á — m. 6.9.31 
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Substituting into 6.9.29 and noting that at quasi-equilibrium 


bD= E 

ida. 

(see 5.7.13), we see that equations 6.9.29 are equivalent to 6.9.24. 

So we have shown that the point where all the dp;/dt's are 0 (6.9.24) and 
which is therefore an equilibrium is also the point at which fitness is maximum 
(or minimum) under quasi-linkage equilibrium. Once again we have an illus- 
tration that a population behaves simply (or at least relatively simply) when 
it is in quasi-equilibrium. Furthermore, as we showed in Sections 5.4 and 5.7, 
this is the state toward which the population tends to move—unless recombi- 
nation is small relative to the epistatic parameters E. When the population is in 
quasi-linkage equilibrium any fitness change is attributable to changes in gene 
frequencies, not to changes in linkage relationships. 

Selection proceeds to maximize fitness when there is quasi-linkage 
equilibrium, that is, with relatively free recombination and small amounts 
of epistasis. At the opposite extreme, with no recombination (or negligible 
amounts) the chromosomes behave as units and the situation is as if there 
were multiple alleles at a single locus. This, too, we have seen leads to a 
maximization of fitness by selection. So the stability criteria based on this 
assumption are valid. 

The intermediate cases are more troublesome. When E is large enough 
relative to c that quasi-linkage equilibrium is not approached, then more 
complicated stability conditions are needed. One such procedure has been 
given by Kimura (19566). 

Ordinarily, epistasis is not sufficient to maintain a polymorphism: 
some overdominance is required. However, if one locus is overdominant, 
a second locus that is not overdominant can also be maintained in stable 
polymorphism provided there is the right kind of epistasis. and tight enough 
linkage. A specific model, proposed by P. M. Sheppard, is this: Assume one 
locus with a pair of alleles, say A, and 43, kept in balanced polymorphism 
by heterozygote superiority in fitness. Another pair of alleles, B, and B3, 
are at another locus on the same chromosome, and interact with the genes at 
the first locus in such a way that A, is advantageous in combination with B, 
but is disadvantageous in combination with B, , while the situation is reversed 
with 45. Then the second locus will remain polymorphic if linkage between 
the two loci is sufficiently close. A special case was investigated by Kimura 
(1956b). 

An interesting aspect of this situation is that it offers a possible explana- 
tion for modification of linkage intensity. In The Genetical Theory of Natural 
Selection, Fisher (1930, 1958) noted that “the presence of pairs of factors in 
the same chromosome, the selective advantage of each of which reverses 
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that of the other, will always tend to diminish recombination, and therefore 
to increase the intensity of linkage in the chromosomes of that species." 
With a situation such as that of the paragraph above, where there is a stable 
polymorphism, recombination on the average increases the frequency of the 
less well-adapted chromosomes so any mechanism which reduces the amount 
of recombination will cet. par. have a selective advantage. Whether this is 
a major factor in increasing the intensity of linkage in actual evolution is not 
known. For further discussion. see Nei (1968). 


6.10 Selection and the Sex Ratio 


It is not immediately apparent how selection operates on the sex ratio, for a 
parent that gives rise to offspring with an unequal sex ratio still produces just 
as many offspring and transmits just as many of its autosomes to the next 
generation as if the sexes were produced in equal proportions. The X X- XY 
sex-determining mechanism assures approximate equality of the sexes in 
many species, but there are many examples of autosomal genes that produce 
minor deviations in the sex ratio. Thus, there is genetic variation on which 
selection could operate. 

It is interesting that Darwin was unable to find a solution to the problem. 
In The Descent of Man he writes: 


In no case, as far as we can see, would an inherited tendency to produce sexes 
in equal numbers or to produce one sex in excess, be a direct advantage or 
disadvantage to certain individuals more than to others; for instance, an 
individual with a tendency to produce more males than females would not 
succeed better in the battle for life than an individual with an opposite tendency; 
and therefore a tendency of this kind could not be gained through natural 
selection.... I formerly thought that when a tendency to produce the sexes 
in equal numbers was advantageous to the species, it would follow from natural 
selection, but I now see that the whole problem is so intricate that it is safer 
to leave its solution to the future. 


It is possible that if Darwin had considered three generations instead 
of only two he might have thought of a solution. For one can reason this way: 
Consider a population that for some reason has an excess of males. In such 
a population a female, on the average, has a larger number of offspring than 
a male. This follows from the simple fact that each individual progeny is 
the product of one egg and one sperm. Therefore the total contribution to 
the next generation is the same for males and females, and if there are fewer 
females the average female produces more progeny than the average male. 
Now, in such a population an individual that produces more than the average 
proportion of females will be producing more of that sex, which in turn 
contributes more to the next generation. Thus this individual will have more 
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grandprogeny than the average of its contemporaries. To the extent that the 
tendency to produce progeny with an altered sex ratio is hereditary, the tend- 
ency will be transmitted and the average of the population will change in 
the direction of equality of the sexes. Whenever the sex ratio becomes equalized 
the selective advantage of the altered type will cease. The same argument 
holds in the opposite direction if the population has an initialexcess of females. 
So, by the simple expedient of looking at one more generation than is custom- 
ary, we arrive at an understanding of the problem. 

This consideration helps us understand why the sex ratio is typically 
somewhere near l: 1. This would be expected to be true even of highly 
polygamous species, where the optimum ratio for perpetuation of the species 
might be thought to be far from equal. But this verbal analysis leaves several 
points vague. What happens if, as is almost always true, there is differential 
mortality in the two sexes? At what ageisthe sex ratio equalized by selection— 
at the zygote stage, at the reproductive age, or some other? For answers to 
these points we shall have to inquire in a more rigorous way. 

This problem was first solved by R. A. Fisher (1930). His treatment 
involved the concept of ‘‘ parental expenditure," the meaning of which is not 
entirely transparent (to us, at least). The question has been discussed further 
from similar points of view by Bodmer and Edwards (1960), Kolman (1960), 
MacArthur (1965), and Emlen (1968a, b). However, we shall treat the problem 
in a more mechanistic way. This is rather similar to the methods used by 
Shaw and Mohler (1953) and Shaw (1958). 

Suppose that there is a population of adults (generation 0) producing a 
total of N progeny, of which Np are male and Nq are female. Among the 
population is a special group, say A, who produce a fraction p of males and 
q of females. Suppose that this group contributes a total of n offspring. Let 
Sm and s, be the probability of survival to adulthood for males and females 
and let fa and f, be the fertility of the two sexes (the /’s will not be constants, 
for their ratio will be related to the proportion of the two sexes in the popula- 
tion). Then in the progeny (generation 1), we will have: 


PROGENY OF A's ENTIRE POPULATION 
GENERATION STAGE MALES FEMALES MALES FEMALES 
l zygote np nq Np Ng 
adult fipSm nqs, NDSm Nås; 
gametes 2NPSmf m 2nqs, f, 2Npsufm 2Nqs;f, 
TOTAL TOTAL 
2 zygote n( psu f m + qs, fy) N(Bsw f m + qsr fy) 
gen. 2 zygotes _ " 
ORE REUS PSmfm + QS, fr DS wf m + ds p fr 


^ gen. | zygotes 
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The average excess of W for type A over the population average, expressed 
as a fraction of the population average, is 


(WaW) _ PSmSm + d5p Ss — Psm f — 455 Sr "— 
: W PSm fmm + qs, fr 


But, recall that the total contribution of males must equal that of females. 
So 


PSm Jm = 957 Sro 6.10.2 


and with this substitution we obtain 


p q 
P EE c 
i e 2d 6.10.3 
» (P — pXa — p) 
2pq 


If the sex-ratio property of type A is completely heritable, then the rate 
of change per generation of this type is 


W,- W 
AP, = p, T = BOE. Pees 6.10.4 


where P, is the proportion of A in the population. With less than complete 
heritability the rate is correspondingly less. However, the change is always 
delayed by one generation, since it is the excess of grandprogeny over progeny 
that we are measuring. 

There are two ways for E, to be 0. One is the trivial case where p = p. 
This simply means that A produces the same sex ratio as the average of the 
population and therefore has no tendency to increase or decrease. More 
interesting is the case where p = d. This says that when male and female 
zygotes are produced in equal numbers, there is no differential contribution of 
a genotype with a different sex ratio. 

That the | : | sex ratio is stable can be seen by noting that when 4 > p, 
E, is positive if p > p; that is to say, if there is an excess of females in the 
population, a genotype that produces more than the average proportion of 
males leaves more descendants than the average. Therefore, if the trait is 
heritable the proportion cf males in the population increases. This will 
continue until p = d, in which case the A genotype no longer enjoys a selective 
advantage. 

Likewise, if the population initially has an excess of males, a genotype 
producing a more than average proportion of females will leave more 
descendants. So, in either case the population tends to move in the direction 
of equal numbers of the two sexes. 
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The equality of the two sexes is at the zygote stage. Notice that the 
factors for survival cancel out and do not appear in the final equation, so 
differential mortality does not affect the 1 : 1 zygotic sex ratio. The number 
at later stages is determined by the relative mortality of the two sexes. At 
adulthood the sex ratio would be ps, : js; . 

It may seem as if differential mortality should have some effect on the 
zygotic sex ratio, but clearly this is not so for the model that we have been 
discussing. However, there is at least one way in which mortality can have 
such an effect. This is applicable to uniparous mammals and especially to 
man. If an embryo or child dies it may be replaced, or there may be some 
compensation in that the next zygote will be conceived earlier than if the first 
had survived, thus making possible a larger number of conceptions. In man, 
males have a higher death rate than females, both prenatally and postnatally. 
Thus if there is compensation, a woman producing an excess of males will 
produce a larger than average number of zygotes, although a smaller number 
of surviving children (unless the compensation is complete). 

As a simple model, assume that compensation is complete so that, despite 
differential mortality prior to a certain age, the number of children that 
reach this age (say 7) is independent of the sex ratio in the sibship. Let S,, 
and S, be the fraction of survivors in each sex from conception until age T. 
Let Sm and s, now stand for the probability of surviving from age T to adult- 
hood. Then, starting with generation 0 as before, the table is modified as 
follows: 


PROGENY OF A’S ENTIRE POPULATION 
GENERATION STAGE MALES FEMALES MALES FEMALES 
l zygote np nq ND Nå 
T NDS m nqS, NDSm NgS, 
age “ec Lane. aS ne. “Sn 7 Le sige oe 
5 PS» 4 4S, PSm+ qS; pSud QS, PSn+ QS; 
=nx =ny =Nx = Ny 
adult NXSm nysr NXSm Nys, 
gametes 2NXSmfm 2nys,f;,  2NŽSmfm 2Nysyfy 
TOTAL TOTAL 
2 zygote n(xSmfm + YSsfr) N(Xsms, + yspfy) 


Note that, since y = | — x, the total number of progeny at age T is the 
same as the number of zygotes. This leads to the same equations as before 
(6.10.1—6.10.4), except that we replace p, q, p, and q with x, y, x, and y. 
Since X and y are the proportions of males and females at age T, it follows that 
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with complete compensation the sex ratio is adjusted to equality at that age 
when compensation ceases. 

Compensation is almost certain to be less than complete (if for no other 
reason than that the replacements occur when the parents are older, thereby 
increasing the interval between generations). In this case the sex ratio would 
be adjusted to be equal at some age prior to the age when replacement no 
longer occurs. In the human population with excess mortality of males and 
partial compensation we would expect an excess of males at the zygote stage 
with a steadily decreasing proportion, fewer males than females at the end of 
the age of compensation, and with an intermediate age where the sex ratio 
is unity. There is no implication that compensation is conscious. 

As we mentioned earlier, Fisher (1930) gives a different argument, but 
he reaches the same conclusion: 


The condition toward which Natural Selection will tend will be one in which 
boys are more numerous at birth, but become less numerous, owing to their 
higher death rate, before the end of the period of parental expenditure. The 
actual sex-ratio in man seems to fulfill these conditions somewhat closely, 
especially if we make allowance for the large recent diminution in the deaths 
of infants and children; and since this adjustment is brought about by a 
somewhat large inequality in the sex-ratio at conception, for which no a 
priori reason can be given, it is difficult to avoid the conclusion that the sex- 
ratio has really been adjusted by these means. 


In this model the population sex ratio tends to equality, either at the 
zygote stage, or at a later stage if there is compensation for lost progeny. 
At the same time there is nothing in the model that says that each individual 
in the population will produce progeny with a 1 : 1 sex ratio. It only says that 
the frequencies of the different genotypes in the population will be adjusted 
so that the average sex ratio of their progeny is unity. A population in which 
each individual produces an equal number of male and female progeny 
would have one obvious advantage over a population composed of equal 
numbers of male-producing and female-producing types; it would be less 
dependent on maintaining the proper ratio of the two types and would there- 
fore be better buffered against chance fluctuations in their numbers. On the 
other hand, this selection, depending on selection between groups rather than 
of individuals within groups, is slower and less effective. Therefore, we should 
not be surprised if there is considerable genetic variability among individuals 
in the sex ratio of their progeny in a population that maintains an average 
ratio of l : 1. 

There are many reasons why one would not expect perfect agreement with 
this principle. It may well be that genes affecting the sex ratio of the progeny 
have direct effects on the individual, in which case the selection on the 
individual effect may be more important and the sex ratio would be modified 


POPULATIONS IN APPROXIMATE EQUILIBRIUM 293 


as a by-product. A second reason is that sex-ratio modifiers may be located 
on the X or Y chromosome. It is especially clear in the Y case that if there 
are two kinds of Y chromosomes, one of which leads to a higher proportion 
of sons than the other, the one with the higher proportion of sons will have 
a selective advantage and, in the absence of other influences, eventually 
replace the competing Y. In this case, the sex ratio of the population would 
simply become the ratio associated with the behavior of this particular Y. 


6.11 Stabilizing Selection 


Several times in this book we have noted that for almost any metrical trait 
(except fitness itself) the most fit individuals are likely to have an intermediate 
value for the trait. If size is the trait, the species typically has a characteristic 
size and individuals that are too large or too small are less viable or fertile, 
and similarly for other quantitative traits. 

This means that selection is directed toward elimination of deviants, 
and the effect should tend to reduce the variance and stabilize the population 
at an intermediate value. Of course the population, especially if it is Men- 
delian, will not become absolutely uniform. There are all sorts of variability- 
generating mechanisms, many of which we have discussed. 

An immediate question is whether selection favoring intermediate values 
tends to perpetuate homozygous or heterozygous combinations. In either 
case the average will be much the same, but if the former is true the variance 
will be less. 

We can get a rough answer by considering a very simple model, such 
as was used in Section 4.7. Suppose that alleles 49 and B, decrease size while 
A,B, increase it. Assume further that the genes are without dominance 
and that there is additivity between loci. 

Selection for an intermediate measurement will favor genotypes 
Ao Ag Bı By, A, A, Bo Bo, or A, Aq B, Bo. Which type will prevail? 

Note first that gametes Ay B, and A, Bo will more often enter into inter- 
mediate zygotes than A,B, or Ag Bo gametes, and therefore are favored. 
Then, if one of these (say A, Bo) is more frequent, either 4, By or Ao B, gametes 
can produce intermediate phenotypes with A, By, but the latter will produce 
by recombination the two unfavored types A, B, and Ag Bo. So we would 
expect that 4, B, type of gamete will increase and eventually the population 
will be homozygous 4,4, Bo Bo. 

This intuitive argument that selection favoring an intermediate type 
leads to an intermediate homozygous genotype rather than an intermediate 
heterozygous genotype was originally given, in a more convincing way than 
we have, by Wright (1935). Wright assumed that the genes were additive 
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in producing the metrical trait and that fitness decreased as the square of 
the deviation from the optimum measure. 

This quadratic model has been studied by several workers (e.g., Robert- 
son, 1956; Kojima, 1959a; Jain and Allard, 1965; Lewontin, 1965a; Levene, 
1967). The general conclusion is that with complete or partial dominance 
on the quantitative trait, selection for intermediate values leads to homozy- 
gosity. If the optimum phenotype is necessarily heterozygous at one or more 
loci, for example if the optimum falls between two homozygotes, heterozy- 
gosity for one (or perhaps more) loci may persist. But, if the number of 
genes affecting this trait is large, it is unlikely that more than a tiny fraction 
will remain heterozygous. The larger the number of loci, the smaller the 
fraction that remain segregating. 

In summary, selection for an intermediate phenotype ordinarily leads to 
homozygosity. If we are to explain much variability we must look elsewhere. 

The two most obvious sources of variation are mutation and hetero- 
zygote superiority in fitness. With the quadratic model, small deviations from 
the mean do not cause much change in the fitness. So, a moderate mutation 
rate can maintain a substantial amount of variability of the metrical trait 
because of the slow rate of elimination of mutants (see Wright, 1935, 1952). 

The second possibility is that some loci are favored as heterozygotes for 
reasons other than their effect on the quantitative trait. The ubiquity of 
pleiotropy argues that genes affecting size (say) also have other effects. Some 
such effects may lead to heterozygote superiority. If so, this provides a ready 
mechanism for maintenance of variability (see Robertson, 1956). 

An example—rather a specialized one—of selection for an intermediate 
is provided by selection for the amount of redundant DNA, such as ribosomal 
DNA. There is good evidence that either too small or too large an amount 
is deleterious. New variability can be generated by unequal crossing over, 
for which there is evidence in Drosophila where the bobbed mutant is the 
phenotypic manifestation of a reduced amount of ribosomal DNA. See 
Kimura (1965) for a similar model. 

This example will show one more instance of the generation of a normal 
distribution. We shall assume that fitness is maximum for an intermediate 
length of chromosome and decreases with the square of the deviation from 
this optimum amount. 

Let K and L be the number of repeats in the maternal and paternal 
chromosomes, respectively. Let the optimum fitness, M, be that of an indi- 
vidual with a total of N repeats, counting both homologs. Assume that fitness 
is measured in Malthusian parameters. We then have: 


NUMBER OF REPEATS N K+L 
FITNESS M M—c(K+L—N)? 
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where c is a proportionality constant. Let py be the frequency of a chromo- 
some with K repeats. 
The average fitness of the population is 


m2M-cY(K-*L- NYpkpi- 6.11.1 
K.L 


Likewise, the average fitness of a chromosome with K repeats is 


my- M-c 2 (K +L- Nyp, 6.11.2 


From our familiar equation for frequency change (5.3.5), 


aPx = py(my — m). 6.11.3 
dt 


lt is convenient to rescale and measure the number of repeats as devia- 
tions from the optimum number (which is N/2 per chromosome), so we let 
k = K — N[2 and / = L — N/2. Then 6.11.3 becomes 


d 
"a -ndxe + D'p, - Y + Dp i] 6.11.4 


With random mating, k and / are independent and, since Y' p, = ). p, = 1, 


20 *Dp- (Se EXE DIL 


6.11.5 
=k'+ 12 42ki 
Thus 6.11.4 becomes 
d = = 
SP. a ~ep,| +P- Y (k? + Pp 
: à 6.11.6 


= ~cpy|k? — &? — 2k? + 2kk] 


That is to say, selection is as if it were acting on a haploid. 

Now consider recombination. Let : be the rate at which a chromosome 
with k repeats is converted into something else. Then the change in frequency 
from this is 


d 
T = —up, tu 2: Px-xf (x), 6.11.7 
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where f(x) is the proportion of changes that increase the length by an amount 
x. Let us assume that x is small and expand 6.11.7 into a Taylor series, which 
gives 





dpy — In- , P x^ d'p, a 
qi o CUR RW ese I(x) 
P 6.11.8 
E ,dp, , ux? d?p, 
x ux TY 2 dk?’ 


where X =) xf(x), x? = Y, x?f(x), and Y f(x) = 1. 
Combining 6.11.6 and 6.11.8 and letting dp,/dt — 0 for equilibrium, we 
get 


x2 d2 d = a = 
pt ut PE e(k? — K? - 247 + 2kk)p, = 0. 6.11.9 


The appropriate solution to this equation (you may verify the result by 
differentiating, noting that k? = k? + V) is 


— (k-k)? 
p eer ay 6.11.10 
k^ , .11. 
JJ 25V 





ux 
k = Cru a 
2 J2uxte 


So k is normally distributed with mean k and variance V. 
In the original units 


N 
K=k+>, 


6.11.11 


V(K) = V(k) = 6.11.12 
With random mating the zygotic mean is 2K and the variance is 2V. 

The mean and optimum will coincide if x = 0, that is, if recombination 
is equally likely to increase or decrease the length—an assumption that is 
reasonable if the unequal exchange events are reciprocal. 
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6.12 Average Fitness and Genetic Loads 


The change of average fitness associated with maintaining the variability in 
a population has come to be called the genetic load. The word **load" was 
first used by Muller (1950) in assessing the impact of mutation on the human 
population. In Muller's thinking the load was clearly a burden, measured in 
terms of reduced fitness, but felt in terms of death, sterility, illness, pain, and 
frustration. In most evolutionary and population genetics considerations, 
however, it is used as a measure of the amount of natural selection associated 
with a certain amount of genetic variability. It is not necessarily a burden; 
on the contrary, it may be the sine qua non for further evolution without 
which the population could become extinct. For example, consider a popu- 
lation, previously of genotype AA, which undergoes a mutation to A’ such 
that A’A is superior to AA, but 4' A' is inferior. This creates variability and 
therefore a genetic load, but the polymorphic population has a higher average 
fitness than if it were monomorphic for the original AA genotype. 

The genetic load also applies to traits other than fitness. For example 
the load may be measured as an increased death rate, decreased fertility, or 
increase in some specific disease. The expressed load is the load that is ex- 
pressed under the mating system of the population, usually approximated 
by random-mating theory. The total load includes also the hidden load that 
is concealed by heterozygosis and is brought out by inbreeding. 


6.12.1 Kinds of Genetic Loads and Definitions 


There are many kinds of genetic loads. Any factor that influences gene fre- 
quencies can lead to a change in the average fitness. We give here a list of 
some of the factors that may produce a genetic load, together with a short 
discussion of each. The first two were introduced by Haldane in his original 
paper (1937), from which the genetic load concept came. 


1. Mutation Every population is exposed continuously to newly occurring 
mutations, the average effect of which is to lower the fitness in the environ- 
ment where the population lives. This decrease in fitness, Haldane suggests, 
is the price paid by a species for its capacity to evolve. 


2. Segregation The segregation load, or balanced load (Dobzhansky, 
1955, 1957), occurs when the fittest genotype is heterozygous. In a sexual 
population homozygous progeny are regularly produced by Mendelian segre- 
gation and the segregation load is the decrease in fitness compared to the 
most fit heterozygote—the type that would prevail if it were not for 
segregation. 
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3. Recombination As pointed out by Fisher (1930), if two chromosomes, 
say AB and ab, are on the average favored by natural selection in the various 
zygote combinations into which they enter, while the two types Ab and aB 
are in disfavor, the average effect of recombination will be to increase the 
inferior types. Thus recombination produces a decrease in fitness. 


4. Heterogeneous Environment If there are different environments and 
the various genotypes respond to these in different ways, it is advantageous 
if those genotypes that fare best in an unusual environment are rare while 
those which prefer an abundant environment are common. This means that 
there will be an optimum allotment of genotype frequencies corresponding 
to the numbers of available spaces in the various environments. Any actual 
population will depart from this ideal, and to the extent that this is true there 
is a reduced fitness, and therefore a load. Haldane (personal communication) 
suggested that such a load might be called the ** dysmetric" load. 


5. Meiotic Drive or Gamete Selection A gene that is favored by 
meiotic drive or gamete selection, or a gene that is closely linked to such a 
locus, may persist in the population despite being harmful in various zygotic 
combinations. The population fitness will therefore be lowered and this 
again creates a genetic load. 


6. Maternal-fetal Incompatibility A genotype may have a lowered 
fitness in association with certain maternal genotypes. A well-known example 
is an Rh+ child with an Rh — mother. The genetic load from this cause has 
been called the incompatibility load. 


7. Finite Population In any finite population the gene frequencies drift 
away from the values that they would have if they had reached equilibrium 
in an infinite population. The equilibrium proportions are usually the values 
that maximize the population fitness. The drift away from the equilibrium 
frequencies will usually reduce the fitness, thereby creating a genetic load. 


8. Migration Immigrants to a new area ordinarily carry genes that were 
selected in their original habitat and are therefore not always optimum in 
the new area. The extent to which the average fitness of the population is 
reduced by the continued introduction of less favorable immigrant genes is 
the migration load. 


Definitions In general we define the genetic load as the fraction by which 
the population mean is changed as a consequence of the factor under con- 
sideration in comparison with a population, supposed to be identical other- 
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wise, in which the factor is missing. Usually the measurement of interest is 
fitness, but the concept can be used for other measurements. For the mutation 
load the reference population is a population with no mutation. For the 
segregation load the reference is a population at equilibrium without segre- 
gation, that is, an asexual population. In such a population the fittest geno- 
type attains a frequency 1, so this is the reference. 

The operational definition of the load will then be 


L = ==, 6.12.1.1 


where Wmax 1S the fitness of the reference population, usually taken as the one 
with maximum fitness. 

We shall treat four examples—mutation, segregation, incompatibility, 
and meiotic drive. 


6.12.2 The Mutation Load 


We shall assume that the generations are discrete and use a model based on 
this assumption. Let p be the frequency of the normal allele (or population of 
isoalleles) and q the frequency of the mutant. If the combination of two 
mutants is approximately the average of the two mutant homozygotes, as is 
often the case, q can stand for the sum of the frequencies of all such mutants. 
We use the inbreeding coefficient, f, as a measure of departure from random- 
mating proportions. 


GENOTYPE ALA or AA A,A2 or Aa A2 4; Or aa 
FREQUENCY pi — p*(1 — f) 9 pf pii—2pq(l—f) p=- f) af 
FITNESS Wi. = Wmas Wi2 = Wmal(l — hS) W22 = Wmar(l — 5) 


The expressed genetic load in a population with inbreeding coefficient 
fis 
Wmax — Ww 2 
L= a hs[2pq(1 — £)] + s[q (1 — f£) af ]. 6.12.2.1 


max 


When f = 0 the random-mating mutation load is 
Lo = 2hspq + sq’, 6.12.2.2 
and when f = 1 the inbred mutation load is 


L, = sq. 6.12.2.3 
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From 6.2.11 the equilibrium frequency of q is approximately 


m u _u 
“s(h+f+q) sz 
where z=h+/f +4. 


q 6.12.2.4 


Substituting this into equation 6.12.2.1 gives the expressed load. We 
consider three special cases of interest: 


1. Complete Recessive (/— 0) and Random-mating Proportion 


(f = 0) 
Substituting from 6.12.2.4 into 6.12.2.1 and letting h = f = 0 gives 


" u 
L=sq* =s-=u. 6.12.2.5 
s 


2. No Dominance (/ = 0.5) and Random-mating Proportions 
When / = 1/2 and f = 0, 6.12.2.1 becomes L = qs. Substituting from 6.2.7 
2u 


L= : 6.12.2.6 
| tu 





The striking fact, first noted by Haldane (1937), is that the effect of 
mutation on fitness is related to the mutation rate and not to the amount of 
deleterious effect of the mutant. From these two examples we see that the 
mutation load is equal to the mutation rate for a recessive mutant and to 
about twice this with partial dominance. What about other cases? In general, 
values of h larger than 0.5 give a larger value than 2u/(1 + u) and values less 
than 0.5 give a lesser value. But this statement does not do justice to the true 
picture. The value of L is very insensitive to changes of h over a wide range 
of values, so that unless A is quite close to O the load is very nearly 2u. Table 
6.12.2.1 gives some numerical examples that illustrate this. 


3. Arbitrary Dominance, Random-mating Proportions If hand f are 
both small, but s is large enough to keep q small, then we can write 6.12.2.1 
as approximately 


L z sq(2h 4- q ^ f), 6.12.2.7 


where all terms that involve a square or product of any of the quantities 
h, q, and f are ignored. Comparing this with 6.12.2.4 shows that when A > 0, 
L =u, but when q and f are nearly O so that h is the dominant term, L = 2u. 

The general principle is that if the mutant is completely recessive or if 
there is enough inbreeding that most mutant genes are eliminated as homo- 
zygotes, the load is equal to the mutation rate. On the other hand, if A is 
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Table 6.12.2.1. Values of C where Cu is the mutation load per locus. Diploidy is 
assumed, u is the mutation rate, s the homozygous selective disadvantage of the 
mutant, and As is the disadvantage in heterozygotes. The mutation rate is taken to 
be 107 5. (From Crow, 1970.) 


h 
f 5 0 01 .02 05 l 5 1.0 1.5 
0 .001 1.00 1.10 1.18 1.41 1.66 2.00 2.01 2.01 
.01 1.00 1.27 1.47 1.78 1.93 2.00 2.00 2.00 
1 1.00 1.62 1.83 1.97 1.99 2.00 2.00 2.00 
3 1.00 1.86 1.96 1.99 2.00 2.00 2.00 2.00 
1.0 1.00 1.92 1.98 2.00 2.00 2.00 2.00 2.00 


.05 .001 1.00 1.07 1.14 1.31 1.51 1.90 1.96 1.98 
.01 1.00 1.13 1.24 1.45 1.63 1.90 1.95 1.97 


.l 1.00 1.16 1.27 1.48 1.65 1.90 1.95 1.97 
=) 1.00 1.16 1.27 1.49 1.65 1.90 1.95 1.97 
1.0 1.00 1.16 1.27 1.49 1.65 1.90 1.95 1.97 


appreciable, then most eliminations are through heterozygotes as we have 
discussed earlier in Section 6.2; in this case the load is twice the mutation rate. 
This is shown in Table 6.12.2.1. 

For an X-linked recessive with equal mutation rates in the two sexes, 
Haldane showed that the mutation load is 1.5u. If the mutation rates in the 
two sexes are not equal the load is approximately 


_ Um + 2Uy 


L 
2 3 


6.12.2.8 


where u, and wu, are the mutation rates in the two sexes. For a general dis- 
cussion of the mutation load for an X-linked locus, see Kimura (1961). 
That partially dominant mutants produce approximately twice the load 
that recessives do is not surprising. It can be seen intuitively by using the 
genetic extinction or “‘ genetic death" argument of Muller (1950). A genetic 
death is a preadult death or failure to reproduce, or the statistical equivalent 
in reduced fertility. In a diploid population of constant size N (therefore 
having 2N genes), a genetic extinction caused by death or failure to reproduce 
eliminates one mutant among the 2N genes at this locus. If the mutant is 
recessive, two mutants are eliminated at once so there are two eliminations 
among the 2N genes per extinction. The 2 : 1 ratio will be correct as long as 
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the dominant mutant is rare enough so that it is never eliminated as a homo- 
zygote. If the mutant is an X-linked recessive, practically all the elimininations 
are through genetic extinction of males. There are 3N/2 genes in the popu- 
lation so the extinction eliminates one among the 3N/2. Thus the number of 
genetic extinctions per gene eliminated for dominant, recessive, and X- 
linked recessives is in the ratio 2:1: 1.5. Likewise, if the inbreeding co- 
efficient is high, the eliminations will be through homozygotes and therefore 
the situation is like that of a recessive mutant. 

If the effects of different loci are independent, the load for all loci 
combined is given by 


1-L-[|ü-L)ze *", 6.12.2.9 


where L; is the load at the ith locus. Since the individual L;'s are very small, 
this is very close to L =)’ L;. The mutation load is then, to a very good 
approximation, 


.L2CYu;2CU, 6.12.2.10 


where C is a constant between | and 2 (usually much nearer 2), u; is the total 
mutation to all deleterious mutants at the ith locus, and the summation is 
over all relevant loci. In other words, U = 2: u; is the total mutation rate to 
harmful genes per gamete per generation. 

For a review and discussion of the mutation load with epistasis, see 
Crow (1970). 

If a population, previously mating at random, is inbred without selection, 
the mutation load for f = | from 6.12.2.1 is 


L7 qs, 6.12.2.11 


where q is the frequency before inbreeding began. Substituting from 6.12.2.4 
and assuming that before inbreeding f was 0, 


L,--. 6.12.2112 
For all loci 
u 1 
L,-)---)»u, 6.12.2.13 
mx z Ž 2. 
where Z is the harmonic mean of z, weighted by the mutation rate, that is, 
1 
X-u 
Bt. 
Y. 





NL | 


POPULATIONS IN APPROXIMATE EQUILIBRIUM 303 


If ^ = Q, the ratio of the inbred load (f= 1) to the random-mating load 
(f = 0) is 


L, sq E 
cu ERES Mem Ve 6.12.2.14 
Lo sq? u 


The more realistic case probably includes partial dominance so that A is 
appreciable. If h is the dominant term in z, 


NE 6.12.2.15 
d m 12.2.1 
Lo 2h 

for a single locus, or if many loci are considered, 
Se 6.12.2.16 
p usce 12.2.1 
Lo 2h 


where Å is the harmonic mean of the individual /t values. The general con- 
clusion is that if ^ is small the mutation load is greatly increased by inbreeding. 
We are referring to inbreeding of a population that has come to equilibrium 
under random mating and is then inbred without selection. 


6.12.3 The Segregation Load 


The decreased fitness, in comparison with the heterozygote in situations where 
the heterozygote is favored, arises from the fact that in a Mendelian popu- 
lation there will always be inferior homozygotes arising by segregation. 

To consider the question quantitatively we consider first a simple, 2- 
allele model. 


GENOTYPE A141 A,A: Aa Aa 
FITNESS Wi1 = w(1— s) Wiz? =W w22 = w(1— t) 
FREQUENCY Pi: =p? 2P12 = 2pq P2.=q? 


Assume that s and t are large enough, relative to the mutation rates, that 
mutation can be ignored. Following the procedure used above for the muta- 
tion load, we write the frequency of allele A one generation later in terms 
of the present genotypic frequencies and the selection coefficients. Thus, 
writing p' for the frequency next generation, 


"3 p^w(l — s) + pqw 
w 


, 
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where w= p?w(1 — s) + 2pqw + q?w(1 — t). Equating p to p' to specify an 
equilibrium and solving leads to 





p=; q= , 6.12.3.1 


st 
w= «(1 = A z 6.12.3.2 
S+t 
and from the definition 6.12.1.1, the segregation load is 
t 
| eon 6.12.3.3 
s+t 


Notice that the formula can also be written as 
L = sp? + tq? 6.12.3.4 
= Sp = tq, 6.12.3.5 


where p and q are the equilibrium values of the allele frequencies. 

This example illustrates two principles of the segregation load. The first, 
illustrated by 6.12.3.4, is that the allele with the least selective disadvantage 
makes the largest contribution to the segregation load. For example, if 
s = 0.01 and t = 1, which means that A, Aj is either lethal or sterile, sp? = 
100/101? 20.01 and tq? = 1/101? = 0.0001. So the more drastic allele 
contributes only 1 % of the total segregation load. 

The second principle, first noticed by N. E. Morton, is illustrated by 
equation 6.12.3.5. This is that one can compute the segregation load from 
information on only one allele. This is also true for multiple alleles. 

If there are multiple alleles there are numerous possibilities for stable 
equilibria, as discussed earlier in Section 6.8. In the following discussion, we 
assume that the conditions for a stable equilibrium have been met. We still 
assume random-mating proportions. 

Let the genotypes and fitnesses be: 


GENOTYPE A. A, A. Ay 
FITNESS wu = Wl — Su) wi = w(1 — sy) 6.12.3.6 
FREQUENCY pi 2pip, 
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From the general equation for gene frequency change, 


Ap c Pn) 


Ww 


6.12.3.7 


At equilibrium Ap; = 0, and therefore w; = W, as was shown in Section 6.8. 
So we are led to the very reasonable conclusion that, at equilibrium, each 
allele has the same average fitness as any other, or as the average of all. If 
this were not true, then the allele with a higher average fitness would tend to 
increase and there would be no equilibrium. 
The segregation load, assuming that for some heterozygote 5;; = 0, is 
w-w W—Wi 


L= = —— =S 6.12.3.8 
w w 





where 
w; = w(1 — s;). 
This can also be written as 


L= » DjSij;- 6.12.3.9 
J 


A more useful form is 
L> pisi. 6.12.3.10 


This follows from 6.12.3.9 a fortiori, since the right side of 6.12.3.9 contains 
several nonnegative terms, of which pisi; is only one. 

If all heterozygotes have equal fitnesses and all homozygotes are inferior, 
but not necessarily equal to each other, then the inequality becomes an 
equality. This is because with equal heterozygotes s;; = 0 for all combinations 
where i # j and therefore p; sj; is the only nonzero term in 6.12.3.9. Writing 
6.12.3.10 as 


mp. 


and noting that )' p; = 1, we obtain 
l 
L) —= l, 
Lx 
and 


6.12.3.11 


t 
ll 
3I lo 
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where $ is the harmonic mean of the homozygous disadvantages, sj, 
and n is the number of alleles. This shows that for comparable fitness 
coefficients, the segregation load is inversely proportional to the number of 
alleles maintained in the population. 

Equation 6.12.3.]0 extends the principle previously mentioned to mul- 
tiple alleles. Regardless of the individual fitnesses, if the population is at 
equilibrium under selective balance, the segregation load (or at least a mini- 
mum estimate thereof) can be gotten from information on a single allele 
and its homozygous effect on fitness. The minimum estimate of the load is 
the product of the frequency of that allele and its homozygous selective 
disadvantage relative to the best genotype. 

For example, the frequency (q) of the recessive gene for phenylketonuria 
is about 0.01. The fitness of persons homozygous for this gene is very nearly 0 
(or at least it has been until very recently), so s = 1. Thus the minimum segre- 
gation load if this allele is maintained by selective balance is sq or 0.01. The 
population fitness is at least 1 % less than that of the heterozygote, or the 
best heterozygote if there are multiple alleles. In contrast, if this is a fully 
recessive mutant maintained by recurrent mutation, the necessary mutation 
rate is 0.0001. The mutation load is only 0.0001. So the genetic load in this 
case is 100 times as large if the abnormal gene is maintained by heterozy- 
gote advantage as if it were determined by recurrent mutation. 

A group of independent loci will have a collective segregation load that 
is roughly the sum of the individual loads until the number gets large. Sup- 
pose there were 100 loci, independent in inheritance and in their effect on 
fitness, and each with a load as large as that just mentioned. The total load 
would then be 100 x 0.01, or 1. This means that, with independence, the 
average fitness of the population relative to the best heterozygote is 
(1 — 0.01)!°°, or roughly e^! = 0.37. With 400 loci, e^ * = 0.02. 

In general, the average fitness and load are 


w= we >! L=1-—e 2%! 6.12.3.12 


where / is the load for an individual locus. 

The load can quickly get to be very nearly one if the number of poly- 
morphic loci is large. That this creates problems in accounting for large 
numbers of segregating loci in a population has been discussed repeatedly. 

There are several ways in which a large amount of polymorphism can 
be maintained without a large segregation load. 

One possibility is that the selective differences are very small. If these 
are much less than the reciprocal of the effective population size, the allele 
frequencies will be largely determined by random drift and mutation as we 
shall discuss in Chapter 8. Of course, with neutral genes there is no load. 
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Whether any large number of neutral or nearly neutral mutants exist is an 
open question, but the evidence for them is increasing (e.g., King and Jukes, 
1969). 

A second possibility is that some polymorphic loci are maintained by 
frequency-dependent selection. Jf each allele is favored when rare, but not 
when common, there is a stable equilibrium when each is present in inter- 
mediate frequencies. The selective differences are minimized at or close to 
the equilibrium point, so that a population at equilibrium can be polymorphic 
with very little load. However, in any real population there will be drift away 
from the equilibrium point because of random processes and the population 
is returned to the equilibrium only by selection. So the load is not 0. 

Linkage can reduce the segregation load by holding together a group of 
heterotic genes, at least under some circumstances. An extreme example is 
the case where there are several loci, each occupied by two alleles, and with 
both homozygotes lethal at every locus. Under this model the segregation 
load is 1/2 at each locus, and for n loci will be 1 — .5". However, if all the 
loci were linked into two chromosomes, complementary at each locus, the 
load would be reduced to 1/2. 

Sved, Reed, and Bodmer (1967), King (1967), and Milkman (1967) have 
all suggested that a threshold or truncation-selection model can greatly 
decrease the segregation load. The model assumes that beyond a certain level 
of heterozygosity additional heterozygous loci make no increased contri- 
bution to the average fitness. In an extreme form, all individuals beyond a 
number x of heterozygous loci have fitness 1 and those below this number 
have fitness 0. An alternative model with much the same consequences is 
that a certain fraction p of the individuals are selected and the rest are 
culled. Those individuals that are selected are the ones with thelargest number 
of heterozygous loci. 

It is doubtful, we think, that natural selection acts by counting the 
number of heterozygous loci and then sharply dividing the population into 
two groups based on the number of such loci. But the animal breeder prac- 
tices truncation selection with respect to phenotypes, and it may be argued 
that natural selection approximates this pattern sufficiently well to alter 
substantially the number ef polymorphisms maintained by a certain ameunt 
of selectien. The question is one that requires empirical answers; clearly not 
enough is known about gene interactions to judge the realism of such models 
at present. 

The introduction of linkage into threshold models leads to mathematical 
difficulties, but computer simulations have shown that it is possible to devise 
systems in which a great amount of polymorphism is maintained. Wills, 
Crenshaw, and Vitale (1969) have studied one such model. They assume 
truncation selection of a certain intensity (e.g., 10% selective elimination, 
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where eliminated individuals are those with the smallest number of hetero- 
zygous loci). With close linkage the population tends to retain particular 
chromosomes—generally those that are mutually complementary—and a 
moderate amount of selection can retain a large number of polymorphisms. 
Again, how realistic the assumptions are is unknown. 

Which of these mechanisms are the more important in determining 
natural polymorphisms and whether additional mechanisms play a role are 
among the most intriguing questions of population genetics. The combined 
efforts of experimental studies, natural population censuses, and computer 
simulation studies will probably be required for any real understanding. 

The inbred load for a locus with k alleles is 


L, -Y PiSii 


6.12.3.13 
< kL, 


since Y p,s; is the sum of k terms, each of which (by 6.12.3.10) is less than 
the random-mating load. If all heterozygotes are the same the inbred load 
is simply k times the random load. 

Thus, for a locus maintained by selective balance, or for a group of 
such loci if they are independent, the inbred load is not greater than the 
random load multiplied by the number of alleles maintained in the equilibrium 
population. In principle, it should be possible to determine whether inbreed- 
ing effects in man are due mainly to mutationally maintained loci or loci 
maintained by selective balance—provided that the average dominance, Å, 
and the number of alleles, k, are both small. But the uncertainty about these 
and other assumptions and the absence of reproducible data have kept this 
from being an informative approach to the problem. 


6.12.4 The Incompatibility Load 


The only well-understood cause of an incompatibility load is maternal-fetal 
incompatibility for antigenic factors. For example, an A child with an O 
mother has a certain risk of dying as an embryo or neonatally due to anti-A 
agglutinins of the mother. 

Because of the rule that an individual can produce antibodies only 
against antigens he does not possess, and because antigens are (with perhaps 
a few exceptions) the result of dominant genes, it follows that any increased 
death rate will always be in heterozygotes. 

We can write the possibly incompatible types by writing maternal geno- 
types and the allele contributed to the embryo by the father. With random 
mating, they occur with the following frequencies, letting p, g, and r stand 
for the frequencies of A, B, and O alleles. 
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PROB- 
MOTHERS FREQUENCY SPERM FREQUENCY ABILITY 
GENOTYPE (1) GENOTYPE (2) (1) x (2) OF DEATH 
oo r? A p pr? d, 
00 r B à q? d, 
AA p? B q p?4 ds 
AO 2pr B q 2prq ds 
BB q? A p pq? d, 
BO 2qr A p 2pqr da 
The total incompatibility load due to this locus is equal to 
L=d,(pr? + pq?  2pqr) + ds(qr^ + p°g + 2pqr) 
6.12.4.1 


= d, p(1 — p)!  dgq(1 — qY. 


More generally, if p; represents the frequency of A; and d; is the prob- 
ability of death due to the antigen resulting from allele 4;, we can write 
the load as 


L=} d;pi(l — pi, 6.12.4.2 


where d; may, of course, be 0 for some alleles as is probably the case for O 
in the ABO system. 

This assumes that d, is the same irrespective of the mother's genotype 
(assuming she has no 4, allele) and of the other (non-A,) allele in the child, 
which is a reasonable a priori assumption, but not always true. If it is not, a 
separate d for each maternal-fetal genotype combination has to be introduced. 

If either the mother or the child is inbred the incompatibility load is 
changed. 


1. Mother Inbred The load due to the antigen produced by allele 4 
is now 


áp = PR + Ind. Ps] 
= d;i pE — PY — f.) + fall — Pid 
= di p(l — p)(1 — pi + Pi fm)» 


where f,, is the inbreeding coefficient of the mother. 
Summed over all alleles, this is 


D= > d; p(l — pC — pit pif). 6.12.4.3 


Thus the incompatibility load increases in proportion to the inbreeding 
coefficient of the mother. 
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2. Child Inbred Consider the diagram in Figure 6.12.4.1 where M is the 
mother and C is the child. The letters a and b are the two gametes that united 
to form the mother while c is the gamete contributed to the child by the 
father, F. 


Figure 6.12.4.13. A 
diagram of gametic 
connections between 
mother (M), father 
(F), and child (C). 
The solid circles are 
gametes, designated 
by a, b, and c. 


Let E be the probability that c is identical to either a or b, in which 
cáse there can be no incompatibility. Then, letting — stand for identity by 
descent, 


E = prob (a = c) + prob (b = c) — prob (a =b = c). 


But the probability that a = c is twice the contribution to the inbreeding 
coefficient of the child made by the path going through the grandparent that 
contributed a, and likewise for the probability that b = c. Therefore 


E = 2f, — prob (a=b — c), 6.12.4.4 


where f, is the inbreeding coefficient of the child. 

The prob (a =b = c) is 0, unless there is an ancestor to which all three 
gametes, a, b, and c, can be traced back. There seems to be no simple general 
rule for computing this, but it is easily done for individual cases. In most 
human pedigrees this term is 0, of course. 

The incompatibility load is then 


L= D(l1 — E). 6.12.4.5 
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Hence, except for the possibility of a common ancestor of a, b, and c, we 
can say that the incompatibility load increases in proportion to the inbreeding 
coefficient of the mother and decreases in proportion to the inbreeding co- 
efficient of the child. For further discussion, see Crow and Morton (1960). 


6.12.5 The Load Due to Meiotic Drive 


As the final example of a genetic load, consider the effect of meiotic drive 
(Sandler and Novitski, 1957). Typical examples are the t-alleles in mice and 
the SD factor in Drosophila. We shall consider only a simple example. 

Assume that the homozygote is lethal or sterile (which is often true). 
Let h be the selection against heterozygotes. Assume further that the ratio 
of a to A genes contributed by males is K: 1 — K, but that the contribution 
from heterozygous females is the regular 1 : 1 of Mendelian heredity. Meiotic 
drive is typically found in one sex only, so this is a good assumption. 

The genotypes and fitnesses can be designated as 


GENOTYPE AA Aa aa 
FITNESS 1 1—A 0 6.12.5.1 
FREQUENCY Ps Pm PsQm + Pods Qn 


where p, and p,, are the frequency of allele 4 in the gametes of females and 
males; qm = 1 — Pm and qp = 1 — pp. With meiotic drive we cannot make 
the simplifying assumption of Hardy-Weinberg zygote ratios, since the 
gamete frequencies are different in the two sexes (recall Section 2.5). 

The allele frequencies next generation are given by 


; K(1 — h)(p, dy + Pr dm) 


‘= Se ee MN 6.12.5.2 
1 —A(Dnds + Pr Gm) — Im Is 
qy = 2. 6.12.5.3 


To specify the equilibrium conditions, let qm = 4, = 4, and q; =q; =ĝ; = 
4n/2K. This leads immediately to the quadratic 


q2(1 — 2h) -Q,LA(1 + 4K) — 2K] + K[2K — 1 — (12-2K)] 20 6125.4 


or 

Ag? + BG, + C = 0, 6.12.5.5 
where 

A=1-—2h 


B=h(1 + 4K) — 2K, 
C = K[2K — 1 — h(1 + 2K)]. 
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The relevant solution is 


| —B- | B! — A4C 


Qn = 24 6.12.5.6 
The load is 

: hå m Q2 
L= (5,0, + By Am) + Am 4s = 57 (1 + 2K — 24) 5r. 6.12.5.7 


The value of the meiotic drive load for several representative values of 
K and h are given in Table 6.12.5.1. 


Table 6.12.5.1. The load due to meiotic drive. The gene favored by segregation 
distortion is assumed to be lethal when homozygous and to have a disadvantage 
of A relative to the normal homozygote. K is the proportion of gametes with the 
driven gene in one sex; the other sex is assumed to have normal Mendelian 
segregation. (From Crow, 1970.) 


h 
K ———— at ÓÀ———— ——— 
0.00 0.01 0.02 0.05 0.10 0.20 0.30 0.50 
i3 0 0 0 0 0 0 0 0 
.6 .010 .010 .010 .007 0 0 0 0 
7 042 042 .041 .039 .029 0 0 0 
8 .100 .100 .100 .097 .087 .033 0 0 
9 .200 .200 .200 .197 .187 .129 0 0 
95 .282 .282 .282 .279 .267 .205 .031 0 
.98 .360 .360 .359 .356 .342 .270 .083 0 
.99 .401 .400 .400 .396 .379 .298 .103 0 
1.00 .500 .495 .490 .472 .438 .333 .125 0 


When A = 0, as shown by Bruck (1957), 


îm = K —./K(1 — K), 6.12.5.8 
b= 3 VKU — K). 6.12.5.9 


This system has the interesting and unusual property that the load is 
decreased when the lethal gene is partially dominant. This property is brought 
out in the table, which also gives an idea of the range of values for K and h 
that maintain the polymorphism. 
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6.13 Evolutionary Advantages of 
Mendelian Inheritance 


The ubiquity of Mendelian inheritance attests to its evolutionary value. For 
such an elaborate mechanism to be contrived implies that it must confer a 
great advantage on the population possessing it. 

It is not at all apparent that sexual reproduction is of any selective 
advantage to the individual. Its value clearly lies in gene-shuffling, the value 
of which is more likely to be for the long time benefit of the population as a 
whole than for the individual. In fact, as we noted before, Fisher (1930, 1958) 
went so far as to suggest that sexual reproduction is perhaps the only trait 
that has evolved by intergroup competition. 

We shall not try to discuss ways in which Mendelian inheritance may 
have evolved, but we shall discuss briefly some of the ideas about its value 
to the population. 

It is true that the Mendelian system is capable of producing an enormous 
number of genotypes by recombination of a relatively small number of genes. 
The number of potential combinations is indeed great, but the number 
produced in any single generation is limited by the population size, and gene 
combinations are broken up by recombination just as effectively as they are 
produced by it. Furthermore, for a given amount of variability, the efficiency 
of selection is greater in an asexual population, for here the rate of progress 
is determined by the genotypic variance rather than by the genic variance, 
as it is in a sexual system. However, if the environment changes so that 
drastic change in phenotype is needed, an asexual population (in the absence 
of new mutants) is limited to the best genotype in the current population. 
Selection of Mendelian recombinants can produce strains that far transgress 
the former variability of the population. Numerous selection experiments 
have demonstrated that in a few generations the mean can come to lie outside 
the range of what were the most extreme deviants in the population before 
selection began. 

The evolutionary advantage of recombination has often been discussed. 
The two principal ideas are: (1) that recombination makes it possible for 
favorable mutants that arose in different individuals to get into a single 
individual, and (2) that recombination permits the species to respond more 
effectively to an ever-changing environment. The first idea was developed 
originally by Fisher (1930) and by Muller (1932). The second was most 
clearly stated among early writers by Wright (1931 and later) and bySturtevant 
and Mather (1938). 

To consider the first idea, we shall rely on a model first proposed by 
Muller. In an asexual population two beneficial mutants can be incorporated 
into the population only if the second occurs in a descendant of the individual 
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in which the first mutation took place. The limiting factor will be the time 
required for the descendants of the first mutant to increase to such numbers 
that a second mutant becomes reasonably probable. In a Mendelian popu- 
lation all the favorable mutants that occur during this interval can be incor- 
porated. (We are ignoring the loss of a new mutant by random processes, to 
be discussed later in Chapter 8, for this is not essentially different in the two 
systems.) An asexual system will be as efficient as a sexual system only if the 
mutation rate is so low, the selective advantage of the mutant so great, or the 
population so small that the first mutant is established before another favor- 
able mutant occurs. 

We have discussed this situation quantitatively (Crow and Kimura, 1965) 
but we shall give only a qualitative summary here. The situation is illustrated 
in Figure 6.13.1, which is adapted from Muller’s 1932 paper. The three 
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Figure 6.13.1. Evolution in sexual and asexual populations. For 
explanation, see the text. 
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mutants, A, B, and C, are all beneficial. In the asexual population when all 
three arise at roughly the same time only one can prevail. In this case A is 
more fit than B or C; or A may simply be luckier in happening to occur in 
an individual that was for other reasons unusually well adapted. B can be 
incorporated only when it occurs in an individual that already has mutant A, 
and this will not happen on the average until the descendants of the original 
A mutant have grown to numbers roughly the reciprocal of the mutation 
rate. C is finally incorporated, but only after AB individuals are in appreciable 
numbers. So, in an asexual population the mutants are incorporated in series. 

The sexual population, on the other hand, permits the incorporation of 
mutants in parallel. The ratio of the evolutionary rate in a Mendelian to 
that in an asexual population is the number of mutants that occur in the 
interval of time between the occurrence of a mutant and the occurrence of a 
second mutant in a descendant of the first mutant. Numerical calculations 
have been presented in the paper referred to above. 

The qualitative conclusions are clear from the diagrams however. The 
relative advantage of sexuality will be greatest when the population is large, 
when the mutation rate is high, and when the selective advantage of the 
mutant is small—for each of these favors the occurrence of more mutants 
than can be incorporated in series. In other words, the advantage of sexuality 
is greatest when the system is evolving by very minute steps in a large 
population. 

On the other hand, the incorporation of new mutant is not the only 
evolutionary factor of importance. Evolution may also occur by the shifting 
of frequencies of genes that are already present in fairly high frequencies in 
the population, as Wright has emphasized. In this case the argument of 
Sturtevant and Mather is relevant. We can do no better than quote directly 
from their article: 


The simplest system that we have been able to devise, having the required 
property of favoring recombination, is as follows: Two gene pairs, A, a and 
B, b, exist in a population subjected frequently to three different sets of en- 
vironmental conditions, D, E, and F. Condition D favors A but acts un- 
favorably on B, whereas E favors B but lowers the frequency of A. These, if 
properly adjusted as to intensity of selection and number of generations over 
which they operate, will insure the perpetuation of both allelomorphs at each 
locus. Recombination will not, however, be of importance. For this it seems 
necessary to introduce the third condition, F, favoring the combination AB 
(with or without a similar effect on ab), but acting adversely on the single 
types A and B. Under such conditions it would appear that there will be a 
selective action favoring recombination, of the order of magnitude of the 
selection of the AB type, as opposed to Ab or aB, under condition F. Other 
combinations of two or more loci may be exerting similar action in similar 
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or disimilar environmental conditions, and so the net effect will probably be 
to favor recombination for the majority of loci under the majority of condition 
changes. 


Similar arguments apply when the population is divided into partially 
isolated subpopulations with different gene combinations favored in different 
subpopulations and occasional migration between subpopulations (Wright, 
1931 and later). 

Virtually the same argument has been stated recently (and independently) 
by Maynard Smith (1968). We do not know which of these arguments, the 
Fisher-Muller or the Wright-Sturtevant, is the more important. Very likely, 
both are valid, and which is more important depends on whether the incor- 
poration of new mutants or the adaptation of existing variants to variable 
environments is more often the limiting factor in evolutionary advance. It is 
also possible, of course, that neither of these has identified the really crucial 
advantage, and that some third reason is still more important. 


The Advantages of Diploidy The evolutionary advantages of recombina- 
tion can be obtained in species that spend most of their lives as haploids as 
well as in those that are predominantly diploid. Yet there has often been 
evolution toward diploidy. What is the reason? 

At first glance it would appear that there is an obvious advantage of 
diploidy in that dominant alleles from one haploid set can prevent the 
expression of deleterious alleles in the other. However, as soon as a new 
equilibrium is reached the mutation load will be the same; in fact, it will be 
twice as large unless the mutants are completely recessive. From this stand- 
point diploidy is a disadvantage, not an advantage. 

However, when the population that is previously haploid suddenly 
changes to diploid there is an immediate advantage. To be sure, when the 
equilibrium is reached the advantage is gone; but by this time the diploid 
condition may be established and there is no way of going back to haploidy 
without all the deleterious effects of exposing recessive genes. So it may be 
that diploidy is not conferring a lasting benefit, but is the result of a tem- 
porary advantage that cannot easily be gotten rid of. 

There are two other possible advantages of diploidy. One very obvious 
one is overdominance. If such loci are common, there is a genuine advantage 
to diploidy provided such effects are important enough to compensate for 
the greater mutation load from partially dominant loci. 

A second possibility is the protection it affords from the effects of 
somatic mutation. The zygote in a diploid species or the gametophyte in a 
haploid plant may have approximately the same equilibrium fitness, but the 
effects of somatic mutation would be quite different. Diploidy would protect 
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against recessive, or partially recessive, somatic mutants. If the soma were 
large and complicated, as in higher plants and especially animals, a diploid 
soma may provide a significant protection against the effects of recessive 
mutants in critical cells, 


6.14 Problems 


l. 


N 


12. 


A mutant has a selective disadvantage As in the heterozygote. Assume 
that As is large enough that selection in mutant homozygotes can be 
ignored. Show that the mean number of generations that a mutant 
persists is 1/hs. 


. Assume that X-linked hemophilia has an incidence of one in 20,000 males 


and a selective disadvantage, s, of about 0.8. Estimate the mutation rate 
of the gene. 


. Give two or more reasons why the mutation rate estimates for autosomal 


dominant and X-linked recessive mutants are more accurate if s is large. 


. What is the genic variance of an overdominant locus at equilibrium 


under selection ? 


. Show that for equilibrium under selection the equilibrium gene frequen- 


cies remain the same when w;; is replaced by a + bw;;, where a and b 
are constants. 


. Show that the segregation load when f =1 for one of a series of multiple 


alleles is the same as the load for all alleles when f= 0. (Assume that all 
heterozygotes are equal in fitness.) 


. Prove that in equation 6.7.4, when w,, — 2w,; + w22 < 0, |AC/E| < 1. 
. Show, for the model of equation of 6.5.6, that the migration load is 


M(p — p)/p, where p is the equilibrium frequency in the subpopulation 
and p the gene frequency in the immigrants. Is the load the same when 
the favored gene is completely dominant? 


. Derive the characteristic equation of 6.9.6 by letting y'/y = y'/y' = A. 
10. 
11. 


Show how 6.12.2.8 may be obtained. 

What would be the effect of polygamy on the zygotic sex ratio? Of 
infanticide where one sex is preferentially killed? Would you expect a 
different zygotic sex ratio for mice than for monkeys? 

Assume that weight in mice is determined by a large number of loci, 
that there is no dominance or epistasis, and no environmental variance. 
Assume further that the fitness of a mouse is proportional to the square 
of its deviation from the average weight in the population. Show that 
the homozygous load is twice the randomly mating load. 
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13. Given the array of fitnesses 


A, A, As 

A, 090 1.00 1.15 
A, 100 100 1.10 
A, 1.15 LIO 0.80 


will this lead to a stable equilibrium? Is there a stable equilibrium if 
allele A; is lost? 

What frequency of the Rh" gene (d) will maximize the incidence of 
hemolytic disease in RA * embryos with RA^ mothers? 

What frequency of the A, B, and O alleles will maximize the ABO 
incompatibility load? Are these frequencies independent of d, and dp? 


iE; 
A FINITE 
POPULATION 





n preceding chapters we have considered mutation, migration, and selec- 

tion as factors causing deterministic changes in gene frequencies and in such 

population properties as average fitness or performance. There was the 
implicit assumption that the population is large enough that random sampling 
of gametes does not introduce an appreciable amount of noise into the system 
and that the migration and selection coefficients are either constant or change 
in a predictable way. 

Actual populations are finite so there is some random drift in gene fre- 
quencies as we mentioned briefly in Chapter 3. Also there may be fortuituous 
changes in the other factors, particularly in selection coefficients. 

In this chapter we will continue the discussion begun in Chapter 3 on 
the effects of a finite population number, ignoring the influence of migration 
and selection. Later, in Chapters 8 and 9, we consider how these factors 
interact with random processes. 
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7.1 Increase of Homozygosity 
Due to Random Gene Frequency Drift 


As was pointed out in Chapter 3, Section 11, there is a fluctuation in gene 
frequency from generation to generation in a finite population. We can 
regard each generation of N diploid offspring as being derived from a sample 
of 2N gametes from the parental generation. As time goes on the gene fre- 
quency will tend to depart more and more from its original value. Finally 
one of the alleles is fixed while all others are lost and the population becomes 
homozygous for this locus. Since the fixation of genes is an irreversible 
process (if we exclude mutation or migration) the number of fixed loci will 
tend to increase with time. 

We also emphasized that, although the consequences of random gene 
frequency drift and consanguineous mating are both such as to lead to an 
increase in homozygosity and we can measure either by Wright's inbreeding 
coefficient, in one regard the two are quite different. In a large population 
with some consanguineous matings there is a departure from Hardy-Weinberg 
ratios, as given by equations 3.2.2 and 3.2.3, and the proportion by which 
the heterozygosity is reduced is equal to f. In a finite population within which 
mating is at random the population remains in approximate Hardy-Weinberg 
ratios. The reduction in heterozygosity comes from the random changes in 
gene frequency, the net effect of which is to reduce Y' p;p; (the proportion of 
heterozygotes, i # j) by a fraction f. 

In a population of N diploid individuals mating completely at random, 
including the possibility of self-fertilization, the heterozygosity decreases at a 
rate 1/2N where N is the effective population number (see 3.11.2). In terms 
of the inbreeding coefficient 


l l 
fi= ant (1- zp- 74.1 


as was shown earlier (3.11.1). 
Also, when there is no self-fertilization the change is given by 


f= g-is 74.2 


1 N-1 
g= gy find + (A Jas 743 


where f, is the inbreeding coefficient in generation t and g, is the coefficient 
of consanguinity of two randomly chosen individuals in generation t (see 
3.11.3). 
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From this 
1 1 1 
fast (12 sete 7.1.4 


If there are two sexes of unequal numbers, N,, males and N, females, 


ya ANON 


= Na +N, 7.1.5 


in 7.1.4 (see 3.13.1). 

We also introduced Wright’s concept of effective population number 
in Section 3.13, an idea that will be developed further later in this chapter. 

Random gene frequency drift, either because of a finite population or 
because the selection and migration coefficients vary, plays a central role in 
the “shifting balance” theory of evolution proposed by Sewall Wright. As 
we discussed in Chapter 5, deterministic selection tends to increase the fre- 
quency of those genes which enhance the fitness of the individual. So the 
population fitness tends to increase (subject to some qualification if there are 
complicating factors such as frequency-dependent selection and linkage dis- 
equilibrium). This was discussed in Sections 5.6, 5.7, and 5.9. 

If the mean relative fitness is plotted as ordinate and the various gene 
frequencies as the various abscissae in multidimensional space, we can think 
of the fitness as a hypersurface. In this metaphor we can speak of the surface, 
as Wright has, as an adaptive surface. See Wright (1967 and earlier) for a 
discussion of this concept, and for the relation of the *'*existence" of such a 
surface to quasi-linkage equilibrium. If the surface has multiple peaks and 
valleys a population, which can be thought of as a point on the surface, will 
tend to climb the nearest peak, which is not necessarily the highest. There 
may be no deterministic way in which a population can change from one 
peak to a higher one. 

Wright suggests that such evolutionary bottlenecks may be frequent and 
that it is important that there be ways by which a population can move away 
from the stable equilibrium represented by one peak in order to come under 
the sphere of influence of another, higher peak. This might happen if, for 
any of the reasons mentioned before, there is some random shifting of gene 
frequencies. Such a change might be sufficient to permit the population to 
wander randomly over the surface (still speaking metaphorically) and come 
under the influence of a higher peak. Of course, there is a loss in average 
fitness as the population drifts from the highest point. But Wright argues that 
this is a necessary price for letting evolution find new gene combinations. 
Multiple peaks and valleys will be prevalent if there are complex dominance 
and epistatic relations among the various loci, especially those of a type 
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where two or more alleles that are individually deleterious are collectively 
beneficial. 

The views of Fisher and Wright contrast strongly on the evolutionary 
significance of random changes in the population. Whereas, to Fisher, random 
change is essentially noise in the system that renders the deterministic proc- 
esses somewhat less efficient than they would otherwise be, Wright thinks 
of such random fluctuations as one aspect whereby evolutionary novelty can 
come about by permitting novel gene combinations. 

Whether random gene frequency drift is a way of creating new, favorable 
epistatic combinations or is more like background noise, there is increasing 
evidence that it is prevalent. Molecular biology has shown dramatically the 
wide range of mutational possibility at a single gene locus with its several 
hundred nucleotides. The possibility that many nucleotide substitutions may 
cause inconsequential changes has become increasingly apparent. Some, or 
perhaps many, such changes may alter fitness by an amount that is of the 
same order as the mutation rate or the reciprocal of the effective population 
number, or less. The fate of such mutants is determined largely by random 
processes. The possibility that such random changes may account for a sub- 
stantial part of the amino acid changes observed in the evolution of hemo- 
globin and other proteins has been discussed by several authors recently 
(see Kimura, 1968, 1969b; King and Jukes, 1969; Crow, 1969). In the next 
chapter we shall discuss the rate of evolution that would be expected from 
neutral or near-neutral mutations. 

The other aspect of near-neutral genes and a great multiplicity of poten- 
tial mutations at each locus is the possibility that this may account for poly- 
morphisms, particularly those having no overt effect and detected only by 
electrophoresis or other chemical trickery. This is the subject of the next 
section. 


7.2 Amount of Heterozygosity and 
Effective Number of Neutral Alleles 
in a Finite Population 


It has often been suggested in the past that the wild-type allele is not a single 
entity, but rather a population of different isoalleles that are indistinguishable 
by any ordinary procedure. Since each gene consists of several hundred, or 
perhaps thousands, of nucleotide pairs, the range of mutational possibility 
is enormous, especially when one considers combinations. That some of 
these are essentially equivalent seems reasonable and is reinforced by chemical 
studies showing that amino acid substitutions often do not affect in any 
detectable way the function of certain enzymes. In any case the probability 
that such alleles may exist in substantial numbers seems great enough to 
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warrant an inquiry into the population consequences. The procedure given 
here is an extension of one presented earlier (Kimura and Crow, 1964). 

In this chapter we shall consider only neutral alleles. Those with a slight 
selective advantage or disadvantage (especially those that are overdominant) 
are of interest, but require more advanced methods. These will be discussed 
in Chapter 9. 

To isolate the essential question, we consider an extreme situation in 
which the number of possible isoallelic states at a locus is large enough that 
each new mutant is of a type not preexisting in the population. This provides 
an estimate of the upper limit for the number of different alleles actually 
maintained by mutation. 

Let u be the average rate of mutation of the alleles existing in the popu- 
lation of effective size N,. If the population consists of N individuals, there 
will be 2Nu new mutant genes introduced per generation. The probability of 
two uniting gametes carrying identical alleles is given by 7.1.1. The two genes 
which are identical in state will remain so in the next generation only if 
neither of them has mutated since the previous generation, the probability 
of which is (1 — u)?. Thus we can write 


1 1 
f= I * (i -zy Ja — u)’. 7.2.1 
When equilibrium is reached, f, = f,-,. The solution is 
2 
DERE D NN "i 
2N, — QN, — \)(1 — u) 
Ignoring terms containing u?, we obtain 
1 — 2u 1 
f = ————— 2 m. 7.2.3 
4N,u—2u+1 4N.u+1 
The proportion of heterozygous loci, H, is | — f or 
4N, u 
H x ————. 7.2.4 
4N,u cl 


This gives the probability that an individual chosen at random will be 
heterozygous for this locus. If the effective population number is much 
smaller than the reciprocal of 4u, the average individual will be homozygous 
for most such loci; on the other hand, if N, is much larger than 1/4u the 
individuals in population will be largely heterozygous for neutral alleles. 

It is convenient to define the effective number of alleles (n,) maintained 
in the population by the reciprocal of the sum of the squares of allelic 
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frequencies. In the present model, n, = 1/f (since the proportion of homo- 
zygotes = f = )' pi) and therefore 


n, — AN,u + I. 7.2.5 


Note that this will be less than the actual number of alleles, unless all alleles 
are of the same frequency. 

If p; is the frequency of the ith allele, then the effective number of alleles 
is 1/)° p7. On the other hand, the actual number of alleles is 1/p, where p is 
the mean allele frequency. For example, with three alleles with frequencies 
2/3, 1/6, and 1/6, p = 1/3 (of course, since there are three alleles), but 
Y, p? = 1/2. Thus the number of alleles is three, but the amount of hetero- 
zygosity is the same as would be found in a population with two alleles of 
equal frequency. 

Most of the time in population genetics we are more interested in the 
effective number of alleles than in the actual number. Many of the alleles 
will be represented only once or twice in the population and contribute very 
little to the average heterozygosity or genetic variance. Since it is ordinarily 
the latter quantities that we are interested in, the effective number is the 
more useful. Fortunately, the effective number is much more easily computed, 
as we have just shown. The actual number requires a knowledge of the dis- 
tribution of allele frequencies; this will be deferred until Chapter 9. 

The derivation of 7.2.3 assumed that each mutant is new; that is, that 
it is an allele not currently represented in the population. We can remove 
this restriction. Assume that there are k possiblealleles and therate of mutation 
from one to one of the k — | others is u/(k — 1); that is, the total mutation 
rate is u. Then we modify 7.2.1 to become 


f= [t era eje o 


" ( 7 x) -4-» D — u), 





ignoring the possibility that both alleles mutate simultaneously to the same 
new mutant state. 

Ignoring terms in wv? and letting f, = f,.., = f, we get for the equilibrium 
value, 


— 





anu E ] +1 
Í = oo 7.2.7 


4 1 
Noulz = 7 + 


Ls 
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or 


4N, 
D NEAL NN 728 


k 
4N 
(4) e 





As expected, when k — oo, 7.2.8 becomes the same as 7.2.4. 

We have been assuming where the population includes self-fertilization. 
We can readily remove this restriction. Including the possibility of mutation 
and returning to our model where each mutant is new to the population, 7.1.2 
and 7.1.3 are modified to become 


A -9,-(1— u)?, 72.9 
1 


1 
g = Ex tft (1 — x.Je-.]e — uy. 7.2.10 


Putting these together and eliminating the g’s gives 


1 


f= 3N. 





(1+ f,-2)(1 — u)* + ( TRES — uy. 7.2.11 


1 
Pas 
Ne 


Table 7.2.1. The average proportion of homozygosity, f (upper figure), and the 
effective number of neutral alleles (lower figure) in a randomly mating population 
of effective number N. and mutation rate u. (From Kimura and Crow, 1964.) 


EFFECTIVE POPULATION NUMBER, Ne 


MUTATION RATE, u 10? 10? 10* 105 105 10’ 
10-+ .96 71 .20 .024 .0025 .00025 
1.04 1.4 5.0 41 401 4001 
10-5 .996 .96 .71 .20 .024 0025 
1.004 1.04 1.4 5.0 41 401 
10-5 .9996 .996 .96 .71 .20 .024 
1.0004 1.004 1.04 1.4 5.0 41 
10-7 .99996 .9996 .996 .96 71 20 


1.00004 1.0004 1.004 1.04 1.4 5.0 
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At equilibrium, f, = fit-1 =/;-2 =f, leading to 


l 
A LL 7.2.12 
f 4N,u+ 1 
and 
4N,u 
z ———_., 7.2.13 
4N,u+ 1 


just as before. 


Ne f 

1 1 

2 0.5 

5 0.2 
10 0.1 
204 0.05 
504 0.02 
100 0.01 
2004 0.005 
5004 0.002 
1000- 0.001 

10? 10° 104 105 10° 10? 


Ne 


Figure 7.2.1. Average homozygosity, f, and the effec- 
tive number of neutral alleles, ne, as a function of the 
effective population number, N,, for various mutation 
rates, u. The range of mutational possibility is assumed 
to be large enough that each mutant is an allele not 
currently in the population. Populations where / is 
between 0.5 and 1 are homozygous for the majority 

of loci; those below 0.5 are heterozygous for the 
majority. 
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Table 7.2.1 gives the average proportion of homozygosity, f, and the 
effective number of alleles, n, — 1/f, for populations of various effective 
number and mutation rate. If 4N,u — 1 the effective number of alleles is 2 
and increases as 4N,u gets larger. As 4N,u gets smaller the population 
becomes more nearly monomorphic. For populations with effective numbers 
of order 10? or larger, it should not be surprising to find considerable neutral 
polymorphism, provided that neutral mutations are occurring at rates of the 
order 10^? cz higher. This is shown graphically in Figure 7.2.1. 

Later in this chapter we shall discuss effective population number in 
more detail. At that time we will show that there are two ways of defining 
the effective population number. For many cases these lead to the same con- 
sequences, but not always. To anticipate, the effective number that we have 
been discussing in this section is the inbreeding effective number. 


7.3 Change of Mean and Variance in 
Gene Frequency Due to Random Drift 


Consider a population of N breeding individuals in which the frequencies of a 
pair of alleles A, and A, are p and 1 — p respectively. We assume that mating 
is such that the next generation is produced from union of N male and N 
female gametes, each extracted as a random sample from an infinitely large 
hypothetical population of gametes produced by the parents. Thus the prob- 
abilities that the number of gene A, will be 0, 1, 2, ..., j, ..., 2N in the pooled 
sample of 2N gametes is given by 


pint, 2NQN - 


1 
(1 — p?" 2Np(1 — 3 A - press 


2N(2N —1):-- QN —i tl) . = 
( ) ) ia — py^ Foose, p. 


In other words, the probabilities follow the binomial distribution which is 
obtained by expanding 


[p * (1 - p". 


Since the mean and variance of this distribution are respectively 2Np and 
2Np(1 — p), the mean and variance of the gene frequency x will be given by 


x = E(x) = p, 7.31 


V, = E[(x — p] = E(x?) - p! = = M 


where E stands for the operation of taking expectations with respect to the 
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number (i) of gene 4, and x = i/2N. Fora derivation of the binomial variance 
formula, p(1 — p)/2N, see A.5.3-A.5.5. 

In the previous chapters we used the letter p, or sometimes q, to designate 
allele frequencies. Now, as we shift the emphasis to random effects and 
stochastic processes, we shall designate the allele frequency by x to emphasize 
that this is a random variable. Usually, we use p for the initial value, which 
is what the frequency would be in the absence of random effects. 

We will call the newly formed generation the first generation. Formulae 
7.3.1 and 7.3.2 show that in going to the first generation, the mean of the 
gene frequency is unchanged, while the variance increases by an amount 
inversely proportional to the population number N. Suppose that the second 
generation is produced again from N male and N female gametes taken as 
random samples from the first generation. Let x' be the frequency of A, in 
the second generation; then 


x = E(x’) = E,E (x) = E[EY)] = E(x) = p, 


where E, and E; designate expected value operators in the first and the 
second generation. It will be seen that the mean is again unchanged. Similarly, 
the variance is calculated as follows: Since 


Vy = E[G' — p?] = E, E [x — x) + (x - D 
= EE[(x — x)?] + 2E,[(x — pEx' — x)] + E [x — p]. 
in which 
Ej[(x' — x)*] = 
and 
E(x — x) = 0, 


we obtain 


1— 1 
rS = gg lx- P -6- 75) 











1f » r1-p],ra-p 
Ve = ON | E 2N |+ 2N 
_ Pl — p) (1-5) +S”. 
2N 2N 2N 


If the same pattern of reproduction is continued until the tth generation, 
the mean and the variance become respectively 


x) = P, 


1 — p) 1-13 1 
eee) n 
$ ON 1 2N + +1 3N +1 


= p(l -p| - (1 - =x) | 


7.3.3 


7.3.4 


PROPERTIES OF A FINITE POPULATION 329 


After a large number of generations, the mean is still unchanged, i.e., 


g% = P, 7.3.5 
but the variance increases with time and finally becomes 

yi» = p(l — p), 7.3.6 
since 


: 1 \' 

RE ( | " 
This suggests that at this limit, the gene frequency will become either | or 0 
with probabilities p and 1 — p. The process leading to this state starting from 
an arbitrary gene frequency will be studied in detail in Chapter 8. The state 
x = ] corresponds to fixation of gene A, in the population and x = 0 corre- 
sponds to its loss. The increase of variance with time as shown in 7.3.4 shows 
that the probability of belonging to one of these states increases with time. 

We are referring here to the variance in gene frequency between a group 
of populations, each starting with the same gene frequency. This variance 
increases with time as the populations diverge from their initial value. At 
the same time the average genic variance within each population decreases 
as the individual gene frequencies move toward 0 or 1. In parallel with this 
the average heterozygosity of an individual decreases with time, as we shall 
now show. 

Let H, be the probability that a randomly chosen individual in the 
population at the fth generation be heterozygous. Since this probability is 
2x(1 — x) for a given gene frequency x, we have 


H, = E[2x(1 — x)] = 2E[x — (X? — p°) - p°] 
= 2[x0 — VP — p°], 


which reduces to 


2N 


if we apply 7.3.3 and 7.3.4. 

We see that the average heterozygosity decreases at a constant rate of 
1/2N per generation. If the initial population (generation 0) is produced by 
random mating, the expected frequency of heterozygotes at that time is 
2p(1 — p), so we have 


BU 
H, = 2p(1 — p)(1 — x) " (I1 2. s), 7.3.7 


2N 


as we obtained earlier by another method (see 3.11.2). 


1 f 
H, = ni — Tl (t —0,1,2,...), 7.3.8 
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When N is sufficiently large, this is approximated by 


H,= H,e-3N = 2p(1-— p)e 3s 7.3.9 


(see also formula 8.4.11 in the next chapter). 

At this point, a few remarks are in order. First it should be noted that 
the homozygosity or heterozygosity of an individual within a population is 
a concept distinct from genetic homogeneity or heterogeneity of the popu- 
lation itself. For the latter Wright used the terms homallelic and heterallelic; 
a population is homallelic if it contains only one allele, but is heterallelic if 
it contains two or more alleles. Secondly, as shown above, the probability 
of heterozygosity decreases at the rate of exactly 1/2N per generation under 
random mating, while the probability of coexistence of both alleles within a 
population, though it diminishes each generation, does not decrease at a 
constant rate. Its rate approaches 1/2N only asymptotically. This will be 
demonstrated at the end of the next section and in more detail in Chapter 8. 

Wright (1931) gave an approximate formula, 


t 
L, = Loe aN, 7.3.10 


for the number of unfixed loci at the tth generation. The approximation is 
satisfactory only for a large t and an intermediate initial gene frequency. 
The number of alleles introduces another complication into the process of 
decrease in genetic heterogeneity, but not in the decrease of heterozygosity: 
Formulae 7.3.8 and 7.3.9 are correct not only for two alleles but for any 
number. 

Let x,, X2, ..., x, be respectively the frequencies of alleles A,, A,,..., A, 
in the random-mating population at the tth generation (x, + x; t ^^ t X, 
— ]). The distribution of the frequencies of these alleles in the next generation 
produced by N male and N female gametes taken as random samples from 
generation t follows the multinomial distribution with means, variances, and 
covariances given by 





E(x;) = xi, 7.3.11 
(=x) 
E ere AE) ae vas 
(xi) Xi 2N 7.3.12 
ror XiX; $ 5 
E(xixj) - xix; = — I (i # j). 7.3.13 


The expected total frequency of heterozygotes in generation t + | is 


Hs E( 2 xixj) = 2 EGUXD 


i*J 
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and if we apply 7.3.13 in the right side of the above expression, we obtain 
1 
Ha, = (1 > x)Ls 
Or 
1 
H, = |1 —3N H,. 2.3.14 


The expected total frequency of heterozygotes in a randomly mating popu- 
lation decreases at the rate of 1/2N, regardless of the number of alleles or the 
allele frequencies. If we note that the expected frequency of heterozygotes is 
proportional to 1 — f, (see Chapter 3), we obtain 


1 
1- fer (1 ga - o 
or 


1 1 
= — 1-— — 
fi IN x)^ 2.3.15 


which is equivalent to equation 7.1.1 and also 3.11.1. The above relation may 
also be written as 


Af,  fsi-h 1d 


= OSS SS BÓ 7.3.16 


Starting from J = 0, the inbreeding coefficient in generation t is given by 


LY yt 
4-1-(1- x) eK ES 7.3.17 
2N/  p(l— p) 
When N is large we have, with good approximation, 
t 
f,=1—e 7 7.3.18 


7.4 Change of Gene Frequency Moments with Random Drift 


This section is an extension of the previous one. Here we will study more 
generally the change of gene frequency moments. First, a discrete model will 
be employed to derive a few moments and the results will be compared with 
the corresponding values obtained by a continuous model. The latter model 
will then be used with advantage to derive a general formula for the moments 
when N is large. Some of the results obtained will be found useful in later 
sections. In the next section, we will find that the first four moments are 
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required to describe the change of variance in quantitative characters in 
finite populations. Furthermore, by knowing all the moments for large N, 
we get extensive knowledge of the process of random genetic drift which 
will be treated in detail in Chapter 8, Section 4. Actually, it was through this 
method in conjunction with the method of partial differential equations that 
the entire process of random genetic drift was first constructed (Kimura, 
1955a). 

Let us consider an isolated population of N breeding diploid individuals. 
Let A, and A, be a pair of alleles with frequencies x and 1 — x respectively. 
We assume that mating is random and that the mode of reproduction is such 
that N male gametes and N female gametes are drawn as a random sample 
from the population to form the next generation. Here x may take any one 
of the sequence of values: 


1 2 1 
"2N'2N" 2N’ 
We designate by f(x; t) the propability that the gene frequency is x at the 
tth generation (t =0, 1,2, ...). Let x, be the frequency of A, in the tth 


generation and let óx, be the change of x, due to random sampling of gametes 
such that 


0 1. 


X,44 = X, T Óx,. 7.4.1 
Let 

uy) = E(x? 1) 7.4.2 
be the nth moment of the gene frequency distribution about 0 in the genera- 
tion t+ 1. Conventionally the nth moment around 0 is designated by p}, 
but since we often use the prime (’) to designate values in the next generation, 
we will save the prime for this purpose. 

Now, we note that the distribution in the (t + 1)th generation is the 
result of convolution of the distribution in the tth generation and the sampling 
"error" in reproduction. Thus, in calculating the expectation of x?,, in 
terms of (x, + 6x,)", we take the expectation in two steps: first taking the 
expectation for a random change 6x, which we shall denote by E;, and then 
taking the expectation for the existing distribution of gene frequency which 
we shall denote by £,, such that 


1 
E(x) = Y. x"f(xi t) = ui? 743 


x=0 
is DAE ME "/(2N\ , ; 
Edi 3. (zs 7) C. Ja - ts ras 


since ôx, can be assumed to follow the binomial distribution. 
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For n = 1, 2, 3, and 4, 7.4.4 becomes 


E,(6x,) = 0, 7.4.5 

Ex?) =“ 4s 

E,[(5x)?] Um dn 
and 

EJ (Gx*] = 3x; (1 — x,) x,(1 — x) (1 — 6x, + 6x;) i 


(2N)? (2N)° 


The mean, variance, and higher moments of the distribution can be obtained 
as follows. 
From 7.4.1, 


E(x,41) = E(x, + óx)). 
The left side is p * by 7.4.2, while the right side is 
E(x, + 6x,) = Eglx, + E(0x)] = Eg(x,) = #1, 
since E,(6x,) = 0 from 7.4.5. Therefore 


+ 0 
ue 1)— pP = = po, ) 


i.e., the mean stays constant: 


ae 


py = =p 7.4.8 


To obtain the second moment we square both sides of 7.4.1 and take expec- 
tations. 


E(x/.1) = E[x? + 2x, 5x, + (6x,)’]. 
The left side is p$*! by definition, while the right side is 


E,[x? + 2x, Ej(0x,) + E,(5x,)7] = Ey [x + xü CON 2), 


which becomes 
1 1 1 1 
1— E xE ; — — |p + — 4m, 
(1 - 555) Bota) + zs Ee = (1 - zu zit 


by noting 7.4.5, 7.4.6, as well as 7.4.3. Since p{ = p®, the above equation 
gives 


1 1 
uet 1) _ -(1 M: x) dé aN uo. 7.4.10 
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The solution of this finite difference equation is 
( ( 1 
) 0) _ Ofi — — + 9) 7.4.11 
u$ = (P — pi X 5| u$ 
Let V €? be the variance of gene frequency in generation t, i.e., 
VP = pP — (PY; 
then 


1 t 
yo = HOU = py) + (ue? = ut» "T sx) . 7.4.12 


To find the third moment, we use a similar procedure; cubing both 
sides of 7.4.1 and taking expectations, 


E(xi41) = Elx? + 3xr Es (0x) + 3x, E(0x)) + Es(ôx,)*] 


b 3 3x,x(1—x) | x(1— x,)(1 — 2x,) 
app RUE 


where the last term is obtained from 7.4.7. Thus, we obtain the recurrence 
relation, 


1 3 1 1 2 
poss (x) HD + oy ( E xx) P4 ( B xx) (1 - ay) as 


the solution of which is 


3 1y 
(0 a (0) ~ 67,49) — ,,€0) NND 
p muc 5 (P BH (1 x) 


3 1 1 2 r 
(0). -,,40 (0) Pun re. ed 4. 
+ (us p «5 ax) =x) | TA 


Carrying out the same procedure, we obtain the following recurrence formula 
for the fourth moment: 


pit) = (= l x pO 47 x) = a) 
: 2N 2N 2N]? 
6 1 2 
et Ein — t 0,0 
+ 2N ( x) nes 7.4.16 


1 2 3 
int cer a) 
+( x) sx)! aj) 
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the solution of which is 


18N — 11 1V 
(1r) — ,,(0) (0) __ ,,(0) mt mms 
HP = WO + = (ul — (1 =) 


3 1 1 257 
(epeei) 
t (us phe +34 JU 75x) l1 - ay 


12N-7 4 2N-1 | 


7.4.16 








10N -6% ^ 10oN — 6! 


- [7636-321 


If we Mart from a population with gene frequency p, we have p‘° = p, 
(0) _ 


+ [i - 208 + 


pS = p?, etc., and the above results are expressed as follows: 
HY? = p, 7.4.17 
1 t 
uf = p— p(1— »(: - a] 7.418 
1 t 
V9 -pü- »|1- ( = x] | 7.4.19 
3 1 y 
DlÉp-cp(1-p)i- x) 
Hy = P— 5 Pl »( 2N 
1 í 353 7.4.20 
—-p(l—- —Dn|it-zit-z 
; PCL — p)(2p | aJ =) | ; 
18N — 11 2 y 
(no c — p1 — E 
Ha =P- Fon ae PU »(: xi) 
1 2 Y] 
— p — pX2p — 1)| |1- oi) oN 7.4.21 
2N —1 
+ p(l - pla —p)- exi 


[ose an) ml]: 


The result for variance agrees with 7.3.4; see also Wright (1942) and Crow 
(1954) for alternative derivations. The moment formulae are important in 
finding the process of change in the genotypic variance within lines, between 
lines, and the additive component of the variance within lines in the case of 
inbreeding due to restricted population size. This was first worked out for a 
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completely recessive gene by Robertson (1952). Though his expressions are 
less explicit, the present results seem to be in complete agreement with his, 
which were obtained by the use of matrix algebra. 

In order to find the general formula for the moments of the distribution, 
we shall now make the assumption that the population size N is large enough 
that terms of the order 1/N?. 1/N?, etc., can be omitted without serious error. 
The recurrence formula for the nth moment is obtained as follows: 


E(x7+1) = E[(x, + 6x,)"] 


= Ex + (stes + (5) tc? + 2 


where O(1/N?) denotes a term of the order of 1/N?. 
Neglecting higher order terms, we have 





ES, 
B= Í US |? t "ue Dy, (n=, 2,...). 7.4.22 


For a large N, the moments change very slowly with generations, and 
we can replace the above system of finite difference equations by the following 
system of differential equations. 


dy? mE 
Tex mg quexiatb. ny 7.4.23 





If the population starts from the gene frequency p, we get the following 
results for n = 1, 2, 3, and 4: 


(t — 


Hi =P, 7.4.24 
u$ =p- p(1— pe 2, 7.4.25 
VO = p(1— p - e), 7.4.26 
KP = p — 3p(1 — p 28' — 4p(1 — pp — Ie", 7427 


i 3 
GUERRE | — pe m= = = 72N' 
uf! = p — 3p py p(1 — p)(2p — le 7.4.28 


6 
— tp(l — p)(Sp? — 5p + 1e 2". 


If we compare these formulae with 7.4.17, 7.4.18, 7.4.19, 7.4.20, and 7.4.21, 
it will be seen that even for a rather small N suchas N = 10, they give very 
good approximations to the formulae based on the discrete model. 
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The above formulae may also be expressed in terms of the inbreeding 
coefficient, since 
1 
e 2IN'—|— f, 
as shown by 7.3.18. Thus 


0) — 


Hi =P, 7.4.29 
u$ = p — p(1— pX1 — ff), 7.4.30 
y? = p(1— pf, 7.4.31 
u$) = p — ip(1 — P — f) — dp — p)Qp — 0 — f, 74.32 


u$? = p — $p(1— pl — f) — p(1 — p2p — na - fy 
— p(l — PEL — 5p(t — p) — foi. 


From the above process of calculation, it may be inferred that the solution 
of 7.4.23 for an arbitrary n, (n > 2), with the initial condition © = p", must 
have the form 


7.4.33 





uO = p+ "y co» THE 7.4.34 
i=1 
where 
i | d(i 1) T 
XU TY 4.35 


and CÜ"s are constants which do not depend on t. For a large value of t, 
only the first few of these constants are important and they are rather easily 
determined from 7.4.23. For example, substituting 7.4.34 into 7.4.23 and 
comparing the terms involving 4, in both sides of the resulting equation, we 
get 

a), n — 1) (D 

"(n+ (n= 2) ""' 


from which we obtain 








C a) Co 7.4.36 
n+1 
where C! = — p(1 — p) from 7.4.25. General terms are much more difficult 


to derive so we shall only give the result: 


uM = p Y Qi Dp1 — pX-1YF( — & 1 +2, 2, p) 
i=1 7.4.37 
(n — (n — 2): (n — D. eT, 


(n - D(n 4-2): (n+ Do à 
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where F represents the hypergeometric function, that is, 


m. m 
Fü-iie22, p 214 92002 
7.4.38 
4 EOD DS UE RUE) d exu (pes qd 


1x2x2x3 


For i= l, 2, and 3, the right-hand side of the above expression reduces to 
l, 1 — 2p, and 1 — 5p + Sp’, respectively. 

As the process of random drift proceeds, the probability that the gene 
is either fixed in the population or lost increases gradually. It is possible to 
find this probability from the moment formula 7.4.37 by the following device: 
For the probability of fixation, 


1 
f, t) =lim Y x"f(x, t) 2 lim p. 7.4.39 
no x=0 n-* 00 

This makes use of the fact that for n = œ and O € x € 1, x" = 1 only when 
x = 1; otherwise x" = 0. The right side of 7.4.39 is readily evaluated from 
7.4.37. Let us denote the above probability by f(p, 1; t), meaning that this 
is the probability of the gene A, reaching fixation by the tth generation given 
that its initial frequency is p. Thus we have 


Kit 1), 


fp 150 = p+ Qi € Dp — DC-DIFQ — ii 2,2, pe^ 5h 





l 3 
= p— 3p(l — pe 2N' + Sp(1 — py — 2p)e 2N' 7.4.40 
6 
T 7p(1 T py —Spt+ Sp?)e 2N' ru 
This can also be expressed as 
e (-1y i+), 
f(p, 1; t) = P EL Em [P;- ,(r) — Pa (le CAN ja 


where r = | — 2 pand theP(r)'srepresentthe Legendre polynomials: Po(r) = 
1, P(r) =r, P,(r) = 38r? — 1), P3(r) = Xr? — 3r), etc. 

The probability, denoted by f(p, 0; t), of A, being lost or its allele 4, 
being fixed by the tth generation is obtained simply by replacing p with 
l — p and r with —r in the above expressions. Therefore the probability 
that both A, and A, coexist in the population at the tth generation is obtained 
from 


Q, — 1 — f/(p, 1; t) — f(p, 0; t). 
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If we use the relation Pj( —r) = (—1)'P,(r), we get 
oo (2j* 1)(2j* 2)t 
= a Patr) - Prj42(r)le a 
7.4.42 
—31-— pe 27" done 


Thus for t + oo, we have the asymptotic formula 


I 
Q, oe 6p(1 T pe v. 7.4.43 
Note that this is different from the formula for heterozygosity, i.e., 7.3.9. 
A similar method for calculating moments can readily be extended to 
the case of three alleles 4,, 45, and 4,, though the situation is more com- 
plicated (see Kimura, 19555, 1956). 


7.5 The Variance of a Quantitative Character 
Within and Between Subdivided Populations 


When an infinitely large population is subdivided into isolated subgroups of 
finite size N, within each of which mating is at random, the process of random 
genetic drift will go on in each of them until the frequency of a particular 
allele ultimately becomes either 0 or 1. We will designate the subpopulations 
as lines. 

First, consider a character determined by additive genes. Let « be the 
average effect of substituting A, for 45, so that we can express the genotypic 
values of 4, 4,, A142, and A, A as 2a, a, and 0 respectively. If the frequency 
of A, in a particular line is x,, the genotypic mean and the variance within 
the line are respectively 22x, and 2x?x,(1 — x,). Thus the genotypic variance 
within a line at generation t is 


Vi = Qo? Eq (1 — x,)] = 2x* (uf? — 1), 
which reduces to 
VO = 2a? p(1 — pY(1 — f.) ates 


if we apply 7.4.29 and 7.4.30. The variance between line means at the tth 
generation is 


Vi" = Eg{(2ax,)"] — [E4(22%,)]? 
= 4c? [u$ — (ut) ], 
which similarly reduces to 


V(? = 2a? p(1 — p)2f,. 7.5.2 
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The total genotypic variance (see Chapter 4) is obtained from 
VO) = E,[(22)?x2 + a’2x,(1 — x)] — [E,Qax)]? 
= 2a [uj? — uy" + u?]. 
In terms of f,, this becomes 
yo = 2a? p(1 - p(l + Jd. 7.5.3 


This total genotypic variance may also be derived by noting that the fre- 
quencies of 4,4,, 4,45, and A, A, in the total population with inbreeding 
coefficient f are 


= pf + p-f), 
2P,; = 2p(1 — pl — f), Ht 

P4; 2 (1 — pf & (1 -p — f), 
and that the total genotypic variance for such population is given by 

V, = Qay'P,, + à?2P,; — QaP,,  2aP, y, 
and therefore 

V, = 2a? p(1 — pY +f), 
as was discussed earlier (see 3.10.6). 

The total genotypic variance (7.5.3) is of course equal to the sum of the 
preceding two variances Vi" and V{"). Since Vo = 2a? p(1 — p) isthe genotypic 


variance expected when the entire population is a panmictic unit, we may 
write 


VO=V (1 — Sf), 7.5.5 
VP = Vof, 7.5.6 
V? 2 WL + fa). 7.5.7 


These results for additive genes were given by Wright (1951) and were 
shown to hold under quite general conditions (Wright, 1952). They hold not 
only when lines are completely isolated and drifting toward fixation but also 
for cases when a steady state has been reached under mutation, crossbreeding, 
and selection, as long as f, represents the inbreeding coefficient of individuals 
relative to the total population. 

Returning to the case of random genetic drift in completely isolated 
lines, if the lines are started as random samples from a very large parental 
stock for which fo = 0, we have approximately 


VO = Ve -5 7.5.8 
and 


pie vali = ein), 7.5.9 
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To summarize, the variance within lines decreases at the rate of 1/2N 
per generation and finally becomes 0, while the variance between lines 
increases with time and finally becomes 2V,. 

The situation becomes much more complicated if there is dominance. 
Let Y,,, Y;2, and Y;; be respectively the genotypic values of A,A,, A,A2, 
and A, A,. Fora particular line in which the frequency of A, is x,, the mean 
genotypic value is 


M(x) = Yy,x2 + 2Y,; x(1 — x) + Y22(1 — x,)? 
and the genotypic variance within the line is 

V(x) = Y? x? + 2Y12x(1 — x) + Y220 — x)? — M*(x,), 
of which 

2x(1 — x) [(Yi, — Yura. + (Yiz — Yal — x)! 
is the additive or genic component and 

xt (1 — x)? (Yi — 2Y1; + Yo2)? 


is the dominance component (cf. Section 4.1). 
To make the expressions simpler, we will choose a scale such that 


Y, = l, 
Y, = 0. 


The mean genotypic value averaged over the whole population in the tth 
generation is 


M$? = E [M(x] = Egl2hx, + (1 — 2h)x?] 
or 

Mf? = 2hp\? + (1 — 2h)pẸ. 7.5.10 
In terms of the inbreeding coefficient, this becomes 

Mf? = p — (1 — 2h)p(1 — p)(1 — ff). 7.5.11 
This result is also obtained from 

Mj = P, + h2P,2, 


by using 7.5.4. 
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The total genotypic variance is obtained from 

VP = Eyl x? + h?2x,(1 — x,)] — {Epix}  h2x(1 — x,)]}’, 
and this reduces to 

y? = 2h? uP + (1 — 2h)? — [2h + (1 — 259p. 7.5.12 
In terms of /,, it is written as 


Ví? = p(1 — pY[1 — (1 — 2p — 2h? + 4ph)y(1 — f.) 
— (1 — 2h p(1 — py — fj^]. 


It may be noted that this is also obtained from 


7.5.13 


yi = Pii + h?2P,, — [MY]? 


by using 7.5.4 and 7.5.11. 

If we regard the total genotypic mean and variance as functions of the 
inbreeding coefficient rather than functions of t, writing them as M,(/) and 
(V, ), we have 


My) = p — (1 — 20)p( — pX1 — f), 
WU) = pL — p) — p(1 — pX1 — 2p — 2h? + AphY(1 — f) 7.5.14 
— p>(1 — p)*(1 — 2h —f)?. 
Since these two relations can be derived directly from 7.5.4, it may be seen 
that they hold for any distribution as long as p represents the average gene 


frequency and f is defined such that 7.5.4 holds for the whole population. 
The above relations are also written 


MS) = MO) — f) + MO), 
Wf) = VOU — 0) + V(Df + EM) — MIO) f —S), 
as was done earlier (3.10.1 and 3.10.3). 
The genotypic variance within lines is 
VO = E,LV(x,)] 
= 2h? +(1 — 6h?) a? — Ah(1— 20)? 7.516 
- (1 = 2h)*u$, 


7.5.15 


of which the additive component or the genic variance within lines is 
Lem = E,[2x,(1 = x)L[(1 —h)x,+ h(1 — x)Y] 
= 2h?u? + 2h(2 — Sh)uf? + 2(1 — 2h)(1 — 4h)? 7.5.17 


- X01 — 2hy'u$, 
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and the dominance component is 


Vo, = E4(x7( = x)*(1 = 2h)?] 


= (1 — 2h [u? — 249 + wD. cs 
The variance between line means is obtained from 
VP = ELM'()] — {EglM(x)]}, 
which leads to 
Vi? = Ah* uf? + Ah(1 — 2h)u + (1 — 2h)? ui? 
7.5.19 


- [2h + (1 — 289). 


If the random genetic drift proceeds within each of the completely 
isolated lines, the above results are expressed, by applying 7.4.29—7.4.33, in 
terms of the inbreeding coefficient as follows: 


Voy, = 42 —3h + 34^)p(1 — pil — f) 
-(1— 2h)p(1 — p)2p — DO — f. 7.5.20 
+ 4(1 — 2h)? p(1— pY[1 — Spl — PIO — fi)’, 

Ving) = $3 — 2h + 2h)p(1— p) — f) 
+1 — 2h)p(1 — p)Qp — DO — f)? 7521 
4&1 — 2h? p(1 — pE! — 5p(1 — PIO — f). 


Vita) = $(1 — 2h)? p(1 — pX1 — fj) 


— 1(1 — 2h p — pU — Sp(1 — py] — f). 7.5.22 
Vt = p(1 — p) + 1(—9 + 6h + 4h? + 10p — 20hp)p(1 — py — f.) 
— (1 -2hy'p'ü - py — f) 
7.5.23 


- (1 - 2n)p(1 — p)(2p — 1) — f 
— (1 — 2h)’p(1 — p)[1 — Spl — py) — f). 


These calculations are tedious but straightforward. 

One of the most interesting situations arises when the gene A, is com- 
pletely recessive and occurs in the original stock at a very low frequency. 
In this case, the frequency of homozygous-recessive individuals will increase 
with time and genotypic variance within lines increases up to a certain value 
of inbreeding coefficient, as shown by Robertson (1952). 
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For a completely recessive and rare gene (h =0, p z 0), we have the 
following results: 


VO = bp — £4 — 5(1 — f) - (1 — f°, 7.5.24 
Vil = U — AB- SCL — f? + 21 — f°, 7.5.25 
Ving = ip(a — f)[1 — (0 — f°, 7.5.26 
VP -p[1-30—5)*0-—-4)-30—5fX]. 7.5.27 
Vi? pf. 7.5.28 


Figure 7.5.1 shows change of these variances in terms of change in the 
inbreeding coefficient. As will be seen in the figure, the genotypic variance 
within lines first increases with f and reaches its maximum value when f is 
roughly 1/2 and decreases thereafter to become 0 when f reaches unity. Its 
additive component behaves somewhat similarly. Generally, such a pattern 
of change in V, , i.e., increase followed by decrease, occurs when 6p? < 1 or 
p < 0.41. For a higher gene frequency, the variance within lines always 
decreases with increasing f, as in the case of no dominance. For details, see 
Robertson (1952). 





8 
E: 
E: 
Vw (^) 
0 0.5 1.0 


f 


Figure 7.5.1. Change of total genotypic 
variance (V,), variance between lines 
(V5), variance within lines (Vw), and 

its additive component (V.4,) of a char- 
acter governed by very rare recessive 
genes (Robertson, 1952). 
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An essentially different case in which lines are partially isolated and 
random drift toward fixation is counterbalanced by occasional crossing be- 
tween lines so that there is a steady-state gene frequency distribution has 
been worked out by Wright (1952). He has shown that the result for the 
variance within lines, assuming a completely recessive character, does not 
differ very much in terms of f and p from the above case of complete isolation. 


7.6 Effective Population Number 


7.6.1 Introduction 


In the preceding section of this chapter, we assumed an ideal population of N 
breeding individuals which are produced each generation by random union 
of N male and N female gametes regarded as random samples from the 
population of the previous generation. We then studied the change of mean, 
variance, and higher moments in gene frequency due to this random sampling 
of gametes. We also discussed here and in Chapter 3 decrease in heterozy- 
gosity of an individual and increase of genetic homogeneity within a finite 
population. For a given initial gene frequency, these quantities are expressed 
solely in terms of population number N. 

On the other hand, the breeding structure of an actual population is 
likely to be much more complicated and may differ from this ideal population 
in many respects. Thus it is desirable to have formulae through which such 
complicated situations are reduced to the equivalent ideal case which we 
understand and for which we have formulae. A few of the simpler formulae 
were given in Section 3.13 where the idea of an effective population number 
was first mentioned. 

The very useful concept of effective population number was introduced 
by Wright (1931) to meet this need. Jn a finite population, as we have dis- 
cussed, there is a decrease in homozygosity (inbreeding effect) and a random 
drift in gene frequencies because of sampling variance (variance effect). In 
simpler cases a population has the same effective number for either effect, 
and for this reason Wright used them more or less interchangeably. But for 
more complex situations it is necessary to make a distinction (Crow, 1954; 
Crow and Morton, 1955; Kimura and Crow, 1963). Our treatment in this 
section follows the latter paper. 


7.6.2 Inbreeding Effective Number 


We first consider a monoecious diploid population, mating at random and 
including the possibility of self-fertilization. We assume that each individual 
has an equal chance of contributing to the next generation. Then the number, 
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k, of gametes contributed by an individual to the next generation will follow 
a binomial distribution. A particular gamete has a probability 1/N of coming 
from a particular one of the N parents and there are 2N gametes in all. So 
the probability of k gametes coming from a particular parent is 


2N\ (1\* [35075 
Ys ( as x) . 7.6.2.1 
( k J \N N 
In a population of stable size, 


k=2 7.6.2.2 


and the variance is 


Xea(D a -2) «3 - 2). T 


More generally, if the average number of contributed gametes is k, the 
variance of k will be 


pd cd i5 


Under this circumstance, each parent has the same expected number of 
offspring and the probability of two randomly chosen gametes coming from 
the same parent is 1/N. For a large N the frequency distribution will approach 
the Poisson distribution for which 


V, =k, 7.6.2.5 


as discussed in A.5. 

When such an ideal situation is not realized, we define the effective 
population number by the reciprocal of the probability that two randomly 
chosen gametes come from the same parent. To see this point, let us consider 
first a population of monoecious diploids. We assume that mating is at 
random, though the expected number of offspring is not necessarily the same 
for each individual. Let P, be the probability that two uniting gametes (or 
equivalently, under the assumption of random mating, two randomly chosen 
gametes) come from the same individual of the previous generation, t — 1. 
Then, using the same reasoning as we have previously employed in Section 
3.11, we obtain 


f= P(E) ea pyra 


or 


P, P, 
h-yt ( -g-. 7.6.2.6 
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where f, is the inbreeding coefficient in generation t. On the other hand, in 
our ideal population consisting of N,-, individuals in generation t — 1, the 
corresponding relation is 


1 1 
= ——— 1 - —— | fi-a 6.2. 
Á m m^ i id 
The above two expressions agree with each other if 
1 
P= 
New 


The heterozygosity decreases at the rate 


AH,-  Af-, _ Pr 


(cf. 7.3.16). Thus two monoecious populations with equal P, are equivalent 
with respect to the inbreeding effect and we can define the inbreeding effective 
number by the relation: 


Nap = E 7.6.2.8 
Note that the effective number thus defined is determined by the number 
of individuals in the parental generation. From the above definition, we can 
derive a concrete formula for the effective number when the distribution of 
the number of contributed gametes (k) is known. 
Let k; be the number of successful gametes from the ith parent in 
generation t — 1 (/2 l, 2, ..., N,.,). The number of ways in which two 


gametes can be chosen out of the total number of N,_,k gametes is 


Cg] 
5 > 


of which 


2) 


is the number of cases in which two gametes come from the same parent. 
Thus 


MN N, ,K(N, ,k — 1)’ 
2 
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where k is the average number of contributed gametes per parental individual, 
and 


uk 
ESN 


We will define V, by 


X Qu-E* X p 
N, -1 N,-1 i 





V, 7.6.2.10 


the variance of the number of gametes contributed per individual in the 
parent generation. With this definition, 


Yd = N,- (V + R), 


and, if we note that 
N, 4k = 2N,, 7.6.2.11 
the above probability may be expressed as follows: 


VE Kk- 0) 


' . kKQN, - 1) 
Thus the inbreeding effective number is given by 


2N,- 1 


Ns ae eee 
e“) mp4 E 


7.6.2.12 


where N, is the number of individuals in the tth generation. 
For the ideal case in which the distribution of k is binomial with mean 
k and variance 


f 1 
n= vs J( -R 


1 
= Ki -x) 


the inbreeding effective number reduces to the actual number in generation 
t— l; 





Nakai 


1 N,-3, 


Nan = 





t—1 


as it should. 
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When k does not follow the binomial distribution but the population 
number, N, remains constant (k — 2), the formula for the inbreeding effective 
number becomes 


4N —2 


e(f) 7 Vi +2” 7.6.2.13 


as first shown by Wright (19385). If all individuals contribute equally to the 
next generation (V, = 0), the above formula reduces to 


N, 


e 


yy =2N—-1; 7.6.2.14 


the effective size is approximately twice as large as the actual size, as we 
mentioned in Section 3.13. On the other hand, if only one individual con- 
tributes the entire next generation, 


ZS (kN, -D° rot 


2_ p2 - 
Nai k = K(N,, 1), 


V. 


and 7.6.2.12 reduces to 


N,-,k-1 


N SS = d, 
«o RN S dye A 


as it should. 

The assumption that all individuals have an equal expectation of off- 
spring is unlikely to be met in nature and the effective number in natural 
conditions is usually considerably smaller than the actual number as pointed 
out repeatedly by Wright. 

Next we will consider a population with separate sexes, still assuming 
random mating. Here the situation is somewhat complicated because we 
must consider three consecutive generations. We will define P, as the prob- 
ability that two homologous genes in two individuals in generation t came 
from the same individual in the previous generation, t — 1. Then we have 


l T f- 
in = P (2) ea - POL 
or 
P P. 
fxr =z + — Pht > few 


as shown in Section 3.11. 


350 AN INTRODUCTION TO POPULATION GENETICS THEORY 


For an ideal population consisting of N* , males and N,**, females in 
which each individual has the same expected number of offspring as the 
others of the same sex, the probability P, is given by 


P l (s + =, 6.2.15 
— e 7.6.2.1 
"ANNE, NP 
(see 3.11.5). 
If the numbers of males and females are equal, 
N, 
NIA = NI = CE 
and we get 
] 
P, 2 —. 
' N,- 


It has been shown already (see 3.11.9, 3.11.5) that, for a constant number of 
males and females, heterozygosity decreases at a rate of approximately 
1/(2N, + 1) per generation with N, given by 


4N*N** 


Ne= RFN 


This is again the reciprocal of P,, though P, for this case is slightly different 
from the monoecious case. Thus any two populations (with separate sexes) 
having equal P, are equivalent with respect to the inbreeding effect. 

The formula corresponding to 7.6.2.12 may be derived as follows: Let 
k; be the number of gametes contributed by the ith individual in generation 


t — 1. Then 
ki 
(3) 

ae: 

The term — N, in the denominator, which did not appear in the case of 
a monoecious population, comes from the fact that only alleles that did not 
enter into the same individual in generation t can unite to form generation 
t+. 

We now proceed as in 7.6.2.9-7.6.2.12. 

Noting that N, = N,-,k/2, the above expression is simplified and we get 

_\y+hR-k 
: R(N,-,k a 2) : 


P, = 7.6.2.16 
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The effective number is then given by 


1 N,..k—2 
Mun pe — V, 
"o k—lt—c 
k 
Or 
2N, —-2 
Nan = V 7.6.27 
p A 6.2. 
ee ar a 


Note that the numerator is now 2N, — 2 rather than 2N, — l as in the case 
of a monoecious population. The difference, however, is important only when 
N, is very small. 

When the numbers of males and females, M* , and N,**,, are different 
it is sometimes convenient to calculate the mean and variance of k separately 


for each sex and then combine them to get k and V, from 
k = mE* + (1 — m)k**, 

* * o * LE*x42 7.6.2.18 

V, = mV£ + (1 — mViz* + m(1 — myKk* — k**)?, 


where k* and k** refer to the number of gametes from males and females 
and m is the proportion of males (cf. 3.10.3). Thus if kf and k** are the 
numbers of gametes from the ith male and jth female respectively, then 


peo Lik? ps k 


* ^ OF 
t-] NL 





and 
LGt-Eky — Q4 Cie 
k —Ü . 


* 
ie N** 
t—1 





* > 
1-1 


Also, since m and 1 — m represent the proportion of males and females in 
generation t — 1, 
N= Ni 
m=—, l-m= 
t= t= 1 





If the number of gametes contributed per individual follows the binomial 
distribution, 


] 
y*- e(t x ), 
: Ne, 


= 1 
Vit- (1 OTN ) 


ta 
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and we obtain 


Nep) = 4m(l — m)N,.,, 7.6.2.19 
or , 
Ne = EL CINES . 7.6.2.20 
Ni act NU, 


This agrees with our earlier result (3.11.5), first given by Wright (1931). 
On the other hand, if the population consists of a single pair, a male 
and female, 


N,-1 22, V =0, 
and 7.6.2.17 reduces to 
Ney) = 2, 


as it should. 

Notice that the effective number related to an autozygosity increase in 
generation f + 1 is a function of the population number, N,_,, two genera- 
tions earlier. This is as expected, since with separate sexes two homologous 
genes could not come from a common ancestor more recent than a grand- 
parent. 


7.6.3 Variance Effective Number 


We have just developed the concept of effective population number as this 
relates to the change in the probability of identity by descent. We now con- 
sider a definition of effective number that renders different populations com- 
parable as regards the sampling variance of the gene frequency. This we call 
the variance effective number. 

In many cases the two concepts lead to the same consequence, but not 
in general. 

In an ideal population the sampling variance of the gene frequency drift 
from parent to offspring generation is Vj, = p(l — p)/2N. So the natural 
definition of the variance effective number is obtairted by setting the actual 
variance, V;,, equal to p(1 — p)/N,,, where Naw) is the variance effective 
number. 

Consider first a population of monoecious diploids and let NV,_, be the 
number of individuals in generation t — 1. As in the previous calculation, 
we will denote the number of gametes contributed per individual by k and 
define its mean and variance by 


k RP ou 
pz EP 


7.6.3.1 
N,-1 N,-ı 
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where k; is the number of gametes contributed by the ith individual and the 
summation is over all the individuals in generation t — 1 (i = 1, 2, ..., N, y). 

In our retrospective approach of defining the effective number from the 
observed distribution of contributing gametes, the value of k for a specified 
individual is fixed and not a random variable. However, if we pick out, 
conceptually, an individual at random from the population, k is a random 
variable with mean k and variance V,. 


E()-k, E[(k—- ky]. 7.6.3.2 
Since each individual can contribute both male and female gametes, we will 
denote the number of the two types by k* and &** such that for the ith 
individual 

p= kë + kř*. 


Their average is equal but they have their own variances: 


k*- 2 ki = k** = XM _k 7.6.3.3 
1-1 N,-, 2 
ke B kt*? 
Vi = Dx Ake Vee LU — k**?. 7.6.3.4 


t-1 t-1 

We will assume that the population in generation t — 1 contains a pair of 
alleles A, and A, with frequencies p and 1 — p. The number of individuals 
with genotypes 4; A,, A142, and A; A; within the population will be denoted 
by n,,, 7,2, and 75; (Aii + ni; + n; = Ny). 

While all the gametes from an A,A, individual contain allele A,, half 
of those from a heterozygote (44,42) do, so we will designate by /* and /** 
the number of A, gametes among male and female gametes produced by a 
heterozygote. For given values of k* and k**, /* and /** are random variables 
which follow the binomial distribution with means and variances given by 

* ** 
E(I*) = T E(I**) = t 7.6.3.5 


k* 2 k* k**)\ 2 k** 
* 0M zm ee = —.. .6.3. 
e(( pss s[( 24 Um 


Let us first consider a collection of male gametes which are produced by 
individuals of generation t — 1 and which are destined to form generation t. 
The total number of A, genes contained in them may be expressed as 


r,,k* RE Y p 


where £,, and Ej, denote respectively summation over 4,4, and 4,4; 
individuals in generation t — 1. 
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Similarly, the corresponding quantity for female gametes is 
Lak Ea, 


Thus the change of gene frequency between generations t and t — 1 is given 
by 


1 
êp = S E [(EZ,,k* + Z,1*) + (Z,,k** + X,,1**)] — p, 7.6.3.7 
I-I 
from which we have 
N,-ıkôp = (E,,k* + X,;1*) + (X, k** + X,,1**) — N,- kp. 7.6.3.8 
Let us define random variables X* and X** by 
k* — k* k* 
x* = E(t - E) zu SF En." - 5] 7.6.3.9 
and 
** —— [x*x* k** 
X** =E (k** — k**) - Y, 2 + zr — =). 7.6.3.10 
Then 7.6.3.8 may be expressed as 
No nkópeXt4 X**, 7.6.3.11 
which may be derived by noting that 
k* - k** 2 
eae + 2 =) + [5g + X12 =) = N,-ıpk. 


Now, from 7.6.3.2 and 7.6.3.5, we have 
E(X*) = E(X**) = 0. 
Furthermore, assuming independence of male and female gametes, we have 
E(X* X**) = 0. 
Thus, squaring both sides of 7.6.3.1] and taking expectations, we have 
(N,-,K V, = E(X*^)  E(X**?), 7.6.312 
where 
Vs, = E[(Sp)’]. 


In order to evaluate E(.X*?), we note that in the right side of 7.6.3.9 the first 
two terms are independent of the last term, because variation in the number 
of contributed gametes is independent of the variation in the number of 4, 
genes within the gametes produced by heterozygotes. 
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Thus 
* _ L*12 *\ 12 
B(x*?) = £[, (k" -R+ 5 k | «E[zs(re- 5) 


7.6.3.13 





The first term in the right side of this equation may be evaluated by using 
the relation 


Ve 
Nisi — 1 : 
where Ch- is the covariance between the numbers of gametes contributed by 
two different males, that is, 


Dies (kt - Eug — E) 


Ciim 7.6.3.14 


* _ 
C= 


7.6.3.15 
N,-,(N,-, m 1) 
The relation 7.6.3.14 is a direct consequence of the identity 
$ (ki — k*) = 0, 


because, by squaring both sides, we have 
[Xai - RY) P =0 
or 
N,-,V£ + N,-,(N,-, — 1) Cie = 0, 
which is equivalent to 7.6.3.14. 


If we pick at random two individuals from the population, then the 
expected value of the cross product (k? — k*)(k# — k*) is equal to Ciy, ie., 


E[(k? —-k*(kj—-k*)]2-Ch. (ij). 7.6.3.16 
Noting the above and using 7.6.3.14, we obtain 
k* x k* 2 
i. 





E|z.,oe -R*+ E 


n n,(n,,—1 
= (n; «ve + rar —1)+ niin: + teach, 


4 
n n,»\2 n V 
nort es) - QC) 
t-1 
VE n ni5M? 
EESTI [vn «uu = (n; 2j | 7.6.3.17 


The second term in the right side of 7.6.3.13 may easily be evaluated if we 
note that /* — (1/2)k* follows the binomial distribution with mean O and 
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variance k*/4. The distributions are independent between two different indi- 
viduals. Thus 


kyl? k* ni; 
«e = — = — * 
Ez. =| eza) 12 p. 743.18 


Combining 7.6.3.17 and 7.6.3.18, we get 
Vi n nM] n 
E(X") - T 1 Ia (nas + "ej = (n. t c | + Au. 
7.6.3.19 
Similarly, for female gametes we have 
yet n nM] n 
E(X**?) X ED 1 [min + a) z UT + "un | + ra 
7.6.3.20 
Adding these two equations, we obtain 


V, 2 
(N-k) Ve = 3] Lu +) = (n; + ma) | QE 





t~i — 1 4 2 4 
7.6.3.21 
because 
V7, = VE+Ve*, k= k* + k** 7.6.3.22 
Let us define a coefficient «,_, by the relation 
n 
NS =(1 — «-,)p^ + ip, 
t-1 
n; 
NL — (1-2, ,)2p(1 — p), 7.6.3.23 
t—1 
n7; 2 
Ncc (= p)? + a p. 
i- 


The coefficient a,_, is a measure of departure from Hardy-Weinberg pro- 
portions, whether because of inbreeding or other factors. Then the right side 
of 7.6.3.21 is much simplified, giving 


a y a = U + oan), ME p = =) 
"— p 





Z(N,., De 1) 2 
or 
as] ET. TM^ 
lc - p] ® N,,-1k (+a) +U — a-i). 7.6.3.24 
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The variance effective number is defined by 


New) = 7.6.3.25 
since the sampling variance for the ideal situation of N monoecious individ- 
uals mating at random is 


pa — p) p 


Yop = 2N 


From 7.6.3.24 and 7.6.3.25, we obtain 


PCL — p) N,-1k 


New) = 
(v) — UB. 


ža + a,-1;)+(1 — 0,1) 


or 


2N, 
New) c ae ae | 


s? 7.6.3.26 
ru + %-1) t (1— 0-1) 

which is the required formula for the variance effective number. In this 
formula 


aa li? Na, 
Nae WerebU 

which includes the Gaussian correction, N,-;/(N,-; — 1), and N, is the 
number of individuals in generation t, N, = N,_,k/2. 

Notice that, whereas the inbreeding effective number is naturally related 
to the number in the parent (or with separate sexes, the grandparent) genera- 
tion, the variance effective number is related to the number in the progeny 
generation. This is to be expected, since the probability of identity by descent 
depends on the number of ancestors whereas the sampling variance depends 
on the size of the sample, i.e., the number of offspring. 

We now consider some special cases. When k follows the binomial 
distribution 


1 
4 (rx) 





or 
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and 7.6.3.26 reduces to 
New) = N, 
irrespective of the coefficient o,. ,. 
When the population keeps a constant number N, k = 2, and if a,_, = 0, 
7.6.3.26 becomes 
_ 4N 
M Dm 
However, in a finite population under random mating, the expected 
value of «,., is not O but —1/(2N,-, — 1) (cf. 2.10.1). Thus if the parent 
generation were derived from random mating, but a,_, is not known, we 
substitute the above expected value for «,_, in formula 7.6.3.26. This gives 


2N,-1 — Dk 
New = 1 = 
k 
2(1 + s) 


For example, with self-fertilization, N,., = 1, k = 2, V, = 0, and N,,,) be- 
comes unity as expected. 

If all the individuals contribute equally to the next generation (V, = 0) 
and if k = 2, we have 


Nowy = 2N = 1, 7.6.3.29 


7.6.3.27 


7.6.3.28 


namely, the effective number is about twice the actual number. 

When sexes are separate, the formula for the effective number may be 
derived from the following consideration: The gene frequency in the next 
generation is the average of gene frequencies of male and female gametes, i.e., 


p* + pe 


p 2 


Therefore 


1 
Vsp = 4 (Vs, + Vip) 





-i et ee 
4l 2N? INE 


Thus we obtain 


PROPERTIES OF A FINITE POPULATION 359 


or 
4N*NE* 
Now) = N*a NUS 7.6.3.30 
e e 
where 
N,_,k* 
Ni= oe E à 7.6.3.1 


(1 — ar) +L + aft D 


and the expression for N7* is similar. 

If the males and females are equal in number and have equal progeny 
distributions and if «,_, is not known, we obtain, noting that of, =a**, = 
—1/(N,-1 = I), 


(N,-1 =Z 1)k 


V, 
1 br 
TUE 


Ne = 


In the special case of sib mating, N,_, = 2, k = 2, V, = 0, and we get N, = 2 
as expected. 

When the effective number changes from generation to generation, the 
representative effective number over T generations may also be obtained from 
the consideration of variance as we did for inbreeding (Section 3.13). Let 
Ny be the effective number in the tth generation. Then, as shown in 7.3.4, 
the gene frequency variances in two consecutive generations are related by 





yo- (1- vs» 4 p(l — p) 


ND NG, : 7.6.3.32 
Starting with V{° = 0, the variance after T generations is 
T 1 
vV = p(1— pt -— Ir — =a) (T=1,2,...). 7.6.3.33 


If Nec) is the representative effective size, then 


—) =u h-z) 
1- ==— 
(i3 =O ae 


or 





T1 (1 : ) 2X esi - : ra! 
OB AN. dn NS OND 
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Thus for large N{(),’s and a relatively small T, we have 








i ey Nw 


New (a N e(v) 


Al a 


or 


T 


New) = T 1 E 7.6.3.34 
(2 M 


Thus, as for the inbreeding effective number, the representative variance 
effective number is approximately the harmonic mean of the individual 
effective numbers over the whole time as pointed out by Wright (1938a). He 
gives an example in which the breeding population increases in five genera- 
tions in geometric series from 10 to 109 and then returns to 10 and 
repeats the cycle. For this case, the representative effective size turns out to 
be 54, which is much nearer to the minimum number than to the maximum. 

In many natural populations, the breeding number may stay fairly 
constant with small fluctuations around a certain mean. If it changes for- 
tuitously from generation to generation with mean A and standard deviation 
0, , the effective number is given by 


E: 
New = N — Wo 7.6.3.35 


as long as oy is much smaller than Ñ. This is derived as follows: Let ôN be 
the deviation of N from its mean N, such that 


N=N+65N 
and 

E(óN) -0, | E6óNY = o1. 
Since 


pi-p) PU-pP) m 
N aspe 8) 


Vsp = 
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neglecting small terms, we obtain 





1 2 
E(V;p) = PEP p) | + s. 
Thus 
Au N «(, on) o 25 
N. 2E) a2 N(1- 55] = 8-5, 
LED 


as was to be shown. 


7.6.4 Comparison of the Two Effective Numbers 


To make our comparison simpler, we will consider a population of 
monoecious organisms mating at random. From 7.6.2.12 and 7.6.3.28, the 
inbreeding and variance effective numbers are respectively 


N,-,k-1 
Nan = e 7.6.4.1 
T-k-1 
k 
and 
_ (2N,-, — DE 
New = WC 7.6.4.2 


As noted already, the inbreeding effective number is more naturally related 
to the number of the parents, while the variance effective number is related 
to that of the offspring. The former is usually much smaller than the latter 
if a large number of offspring is produced out of a small number of parents. 
In an extreme case of N,_, = l and k + œ, the inbreeding effective number 
becomes unity while the variance effective number is iud On the other 
hand, if each parent produces just one offspring (k = 1, V, = 0) the inbreeding 
effective number becomes infinite but the variance M nto. number stays 
finite and equal to twice the number of offspring. 

However, these are extreme examples. If the population size is constant 
(N,., = N, = N, k = 2) both formulae reduce to 

4N —2 

Neap = New = V +2 7.0.4.3 
with a slight correction if there are separate sexes; the 2 in the numerator is 
replaced by 4. 
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If the population is growing, the inbreeding effective number, being 
related to the earlier generation, is usually less than the variance number. 
If the population is diminishing the reverse is true. Since in the long run a 
population cannot increase or decrease indefinitely (unless it is to become 
extinct), the periods of growth and decline must approximately cancel and 
the two effective numbers are roughly the same. 

Table 7.6.4.1. summarizes the inbreeding and variance effective numbers 
and includes a number of special cases. 

Crow and Morton (1955) have reported experiments where, by appro- 
priate use of gene markers, the progeny from a large mixed group of 
Drosophila parents could be individually identified as to parentage. From 
such experiments the mean and variance of the number of progeny per 
parent could be computed. These data, plus others from various studies in 
the literature (originally done for other purposes), are given in Table 7.6.4.2. 


Table 7.6.4.2. Relation of effective and actual population number in experimental 
and census data, (From Crow and Morton, 1955.) 


Vi 
k V. Kk Ne 
PROAT Kk ADJUSTED N 
Tok =2 
Drosophila females 
Adult progeny 13.9 4.73 1,82 71 
Egg laying (5 day) 37.2 11.45 1.77 72 
Eggs, lifetime 
wild-type, 25° 714 130 1.36 .85 
wild-type, 30? 430 154 1.71 .74 
wild-type, 19° 941 111 1.23 .90 
vestigial 432 102 1.46 .81 
Lymnaea 391 134 1.68 75 
Women 
New South Wales 6.2 2.61 1.52 19 
England 3.5 2.54 1.88 .69 
U.S., born 1839 5.5 1.28 1.10 95 
U.S., born 1866 3.0 1.93 1.63 .76 


Drosophila males 17.9 11.35 3.18 .48 
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The variance-mean ratio V,/k is given first for the actual data, then after 
adjustment to k = 2. Clearly, any population that is not becoming extinct 
or growing explosively has a mean number of progeny per parent of approxi- 
mately 2, and this adjustment was made by assuming that the deaths occur at 
random. This method for making such a correction, as well as a justification 
for it, is given in the paper by Crow and Morton referred to above. 

It is quite striking that the figures for females of species as divergent as 
Drosophila, a snail, and Homo sapiens have values as similar as those in 
the last two columns. This must mean that the effective population number, 
in so far as this is determined by differential fertility, is not greatly different 
from the number of fertile adults. 

The main factor not accounted for in these data is intrabrood corre- 
lation in fate. In insects where a number of eggs are laid in one spot, all or 
none may be destroyed, and in such a situation the effective number could 
be considerably less than the census number. 


7.6.5 An A Priori Approach to Predicting 
Effective Number in Selection Programs 


Under artificial selection, the effective population number may be smaller 
than the actual number, because the expected number of offspring is different 
in different individuals. This problem was taken up by Robertson (1961) who 
derived a different formula for effective number from the following con- 
siderations: 

Suppose there are N males and N females which are sampled from a 
large gene pool. If the expected contribution of the ith pair to the next gene 
pool is K;, the gene frequency of the newly formed gene pool is Y. K; pi/).. K;, 
where p; is the gene frequency in the ith pair. Thus the sampling variance of 
gene frequency for one generation is 


p(l — p) >. K? 
AŞ Ky ' 


because each pair is a sample of 4 genes. By equating this to the ideal case 
of N pairs with variance p(1 — p)/4N, the effective number is given by 





Ky 
N.= > = : 7.6.5.1 
If we assume that K,, K,, ..., Ky form a sample from a distribution with 


mean jt, and variance V, and that each K; is independent, then 
E(Y Kj) = N? uk + NV, 

and 
E(Y. K?) = N(uk + V). 
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Substituting these in 7.6.5.1 we obtain 


N+C? 


oo T4 C? 5 7.6.5.2 


where C is the coefficient of variation of K between mating pairs, i.e., og/Hp. 

Exactly the same result may be obtained by considering N monoecious 
individuals, if K; denotes the expected contribution of the ith individual. 

In contrast to the previous retrospective approach of measuring effective 
number from the observed distribution of the number of offspring, this 
approach predicts the effective number from a knowledge in the variation 
of selective advantage between families. Actually C? in 7.6.5.2 may be equated 
to the variance of the relative selective advantages. With C = 0 the effective 
number reduces to the actual number, as it should. 

Though his formula is based on the consideration of variance, we can 
give it an inbreeding interpretation, since 

1 NK 


N, (Y Ky’ 
is the a priori probability that two gametes from same parents (in the case 
of separate sexes) or from a same parent (in the case of a monoecious 
organism) will enter into a specified pair of offspring. 

Robertson's idea was used by Nei and Murata (1966) to derive a formula 
for effective population size when fertility is inherited. In a population of 
stable size and with random mating, their formula becomes 


4N 


eee a 65. 
C x 38V, + 2 TuS 


where A? is the heritability of progeny number. For example, if V, = 3 and 
k? = 0.3, N, = 0.52N. 

In general the effective size decreases as the heritability of fitness 
increases. 


7.7 Problems 


l. The effective number of alleles, as used in 7.2.5, is always less than or 
equal to the actual number. Suppose that the frequencies of alleles 
A,, A5, A3, As... are 4, 4, 4, 15,- ... In an infinite population the number 
of alleles is infinite, of course. What is the effective number? (Answer: 3) 

2. Compare 7.3.9 and 7.4.43. Suggest an intuitive reason why these have 
the same final rate of decrease. A glance ahead to Figure 8.4.1 may be 
helpful. 
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. In a group of subpopulations, each of effective size N and starting with the 


same initial frequency, what length of time is required for the gene frequen- 
cy variance between populations to go half way to its equilibrium value 
(see 7.3.4)? 


. Prove equations 7.6.2.18. 
. Explain why « appears in the variance effective number formula (7.6.3.26) 


but not in the inbreeding effective number formula (7.6.2.17). 


. How would the smallness of a population affect the rate of approach to 


linkage equilibrium; i.e., would it enhance the rate, retard it, or have no 
systematic effect ? 


. What is the effective number of a herd of cattle with one bull and 40 cows? 
. What is the effective number of a population of 100 males and 100 females 


and each parent has exactly two offspring? 


. An island population contains exactly 100 persons, equally divided into 


males and females. The initial frequency of a recessive gene is p. What is 
the probability of homozygosity for this gene 10 generations later? If in the 
10th generation two cousins marry, what is the probability that their child 
will be homozygous for this gene? 


. Haldane (1936) showed that in a sib-mated line the equilibrium probabil- 


ity ofalocus being heterozygous for a neutral gene is 12 times the mutation 
rate. Verify this, using the methods of equations 7.2.9 and following. 
Show also that for a self-fertilized population the value is 4 times the 
mutation rate. 


. Show that in a randomly mating population with each parent having an 


equal expectation of progeny about half of the random gene frequency 
drift is caused by Mendelian segregation from heterozygotes and half 
from unequal numbers of progeny. 


o 
STOCHASTIC 
PROCESSES 
IN 

IHE 

CHANGE 

OF 

GENE 
FREQUENCIES 


l ] The various factors which change gene frequencies in natural populations 








may conveniently be classified into three groups. 

The first consists of factors which cause directed changes, such as 
mutation and migration occurring at constant rates, and selection of a con- 
stant intensity. Wright (1949) called these systematic evolutionary pressures. 
The second group consists of factors which produce random fluctuations in 
gene frequencies, of which two different types may be recognized. One type 
is the random sampling of gametes in a finite population. This becomes 
important, as we have emphasized earlier, when the population is small. 
Even in an infinite population the fate of mutant genes depends very much on 
chance as long as such genes occur in small numbers. This will be discussed 
in more detail in Section 8.8. The other type is random fluctuation in system- 
atic pressures, of which fluctuation in selection intensity is especially 
important. 


367 
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There is a third group of factors consisting of events, such as chromosome 
rearrangements, duplication or deficiency of a nucleotide sequence, or poly- 
ploidy, that may occur only once in the history of a species. These are hardly 
amenable to any effective quantitative treatment. 

The purpose of this chapter is to discuss the first two factors, systematic 
and random (or deterministic and stochastic), as they act to change the 
genetic makeup of the population. 


8.1 The Rate of Evolution by Mutation and Random Drift 


In Chapter 7, Section 7.2, we considered the amount of polymorphism and 
heterozygosity that is maintained in a population of finite size by recurrent 
mutation. It was assumed that the mutations were completely neutral selec- 
tively and that many allelic states are possible. We showed that if the mutation 
rate is larger than the reciprocal of the effective population number the 
population is polymorphic and the typical individual in the population will 
be heterozygous for this locus. More specifically, the majority of population 
will be homozygous for the locus if 4N,u < 1 and heterozygous if 4N,u > 1. 
The actual distribution of the number of alleles maintained in this way is more 
difficult and will be discussed in Chapter 9, along with the complicating 
effects of selection. 

A similar situation arises when we look at the process in time. A newly 
arisen mutation is very likely to be lost from the population within a few 
generations because of accidents of the Mendelian process and variations in 
the number of progeny from different individuals in the population. On the 
other hand, if the population is finite, a minority of mutants may be lucky 
enough to persist in the population and ultimately become the prevailing 
type. To be sure, the likelihood of this is extremely small in a population of 
moderate size; but the probability is not zero and in the long time of evolu- 
tionary history events with small probabilities do occur. 

We shall consider now the simple problem of how frequently such 
neutral-gene replacement is expected to occur. Later in the chapter (Section 
8.8 and 8.9) we deal with the process in more detail, including the effects of 
selection. 

Consider a population of size N (actual number, not effective number). 
If we look sufficiently long into the future the population of genes at a par- 
ticular locus will all be descended froma single allele in the present generation. 
In the vocabulary of Chapter 3, they all will be identical by descent and the 
population will be autozygous for this locus. This is the result of the inexor- 
able process of random gene frequency drift. If, in the present generation, an 
allele A, exists in frequency p, the probability is simply p that the lucky allele 
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from which the whole population of genes is descended is A, rather than 
some other allele. 

Now, if mutation occurs at a rate u per gene per generation, then the 
number of new mutants at this locus in the present generation is 2Nu. The 
probability that a particular gene will eventually be fixed in the population 
is 1/2N. So the probability of a mutant gene arising in this generation and 
eventually being incorporated into the population is 2Nu(1/2N) =u. We 
have the remarkably simple result that 


Rate of neutral gene substitution = u. 8.1.1 


That is to say that, viewed over a long time period, the rate of evolution 
by fixation of neutral mutants is equal to the mutation rate. Stated in another 
way, the average interval between the occurrence of successful mutants is 1/u. 

The observed rate of evolution of amino acids in mammalian hemo- 
globins is about one replacement per codon per 10? years. This could be 
accounted for entirely by neutral substitutions if the mutation rate to such 
alleles were 107° per codon per year. This is not to assert that this is neces- 
sarily the major mechanism by which amino acids evolve, but we would not 
be surprised if it turns out that an appreciable fraction of nucleotide replace- 
ment in evolution is of this type. For a discussion, see Kimura (1968), King 
and Jukes (1969), and Crow (1969). 

We have not considered the time required for a successful mutant to go 
from a single representative to complete fixation. Later, in Section 8.9, it will 
be shown that the average time required for those mutants which are successful 
to change frequency from 1/2N to 1 is 


i —4N,. 8.1.2 


This does not depend on the mutation rate. This is as expected; since only 
one representative will be fixed, the time required does not depend on how 
many mutants there are. Note also that the relevent population number this 
time is the effective number, not the actual number. 

These points are illustrated in Figure 8.1.1. The rate of gene substitution, 
when we consider a time period that is long with respect to the time required 
for a single substitution to occur, is u. This is given by the reciprocal of the 
time interval between the occurrence of successive successful mutants, as 
shown on the graph. The time required for a particular gene to be substituted 
is f, also shown. 

If ? is small relative to l/u, that is, if 4N,u <1, then the population is 
monomorphic most of the time, as illustrated by situation b in the figure. 
On the other hand, if f is comparable to l/u as might be the case in a large 
population with a high rate of mutation to neutral alleles, the population 
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Figure 8.1.1. Gene substitution by random drift and mutation. The abscissa 
is time over a very long period; the ordinate is the number of mutant genes 
descended from a single mutant. The upper figure (a) is intended to 

illustrate that, while most mutants persist a few generations and then are 
lost, an occasional one increases to eventual fixation. The second drawing (b) 
shows only the mutants that eventually become incorporated into the 
population. The time scale is therefore much longer. The third (c) shows a 
situation in a larger population or one with a higher rate of occurrence of 
neutral mutants. In this case the time required for a replacement (7) is 
comparable to that between such events (l/u), with the result that there 

is considerable transient polymorphism. 


will have considerable transient polymorphism. At any one time it is likely 
to have more than one allele, although these will be different alleles at 
different times. This is illustrated by situation c. 

Later in the chapter (Section 8.8), it will be shown that the probability 
of fixation of a single gene that is slightly favorable is approximately 25. So, 
even favorable mutants are lost most of the time. However, if they occur 
with any appreciable frequency they are substituted considerably more 
rapidly than neutral alleles, as expected. 
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8.2 Change of Gene Frequencies as a Stochastic Process 


In many mathematical problems arising in population genetics, the process 
of change in gene frequency may be treated as deterministic, as we have 
done in Chapters 1 through 6. This approach was extensively developed by 
Haldane (1924 and later), especially for single-locus problems. It is still 
useful. There are many circumstances where such an approach is sufficiently 
realistic to yield interesting and reliable information, as we already illustrated. 

Yet, when we consider that actual populations are all finite, that many 
mutant genes may be represented only once at the moment of their occurrence, 
and that organic evolution has proceeded over an enormous period of time 
in an ever-fluctuating environment, we realize the necessity of an approach 
that can take indeterminacy into account. 

In this chapter we treat the process of change in gene frequencies as a 
stochastic process. By this we mean a mathematical formulation of chance 
events in a process that proceeds with time. 

The pioneering work in this field has been done by Fisher (1922, 1930) 
and Wright (1931 and later). These authors have been mainly concerned with 
the state of statistical equilibrium that is reached when the form of the dis- 
tribution becomes constant. The problem of constructing the entire history 
of change in gene frequencies starting from arbitrary initial frequencies is 
more complicated. Several practically important cases have been solved by 
one of the authors and he has reviewed the historical development of the 
subject elsewhere (Kimura, 1964). 

A mathematical approach which has proven to be very powerful makes 
use of “ diffusion" models, in which two diffusion equations, the Kolmogorov 
forward and backward equations, play a central role. 


8.3 The Diffusion Equation Method 


In population genetics the fundamental quantity used for describing the 
genetic composition of a Mendelian population is gene frequency rather than 
genotype frequency. The main reason for this, as was discussed in Section 
2.1, is that each gene is a self-reproducing entity and its frequency changes 
almost continuously with time as long as the population is reasonably large. 
On the other hand, genotypes are produced anew in each generation by 
recombination of genes and therefore do not have the continuity that genes 
have. 

Also, we note that in actual evolution the gene frequency changes are 
typically very slow. To be sure, there are some exceptions. One is the rapid 
increase in the melanic gene in some Lepidoptera in industrial areas. Another 
is the development of resistance to insecticides and antibiotics. But, as 
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pointed out by Haldane (19494), the typical rate of evolution shown by fossil 
records is of the order of one-tenth of a darwin, where one darwin represents 
a change by a factor e in a million years, or 1076 per year. On the ordinary 
scales by which we consider time, this is exceedingly slow. 

Therefore, in the following treatment we will regard the process of change 
of gene frequency as a continuous stochastic process. Roughly speaking, this 
means that as the time interval becomes smaller, the amount of change in 
gene frequency during that interval is expected to be smaller. More strictly, 
the process is called a continuous stochastic process if for any given positive 
value e, however small, the probability that the change in gene frequency x 
during time interval (t, t + ôt) exceeds & is o(ót), that is, an infinitesimal of 
higher order than ôt. Furthermore, we will assume that the process of change 
in gene frequencies is Markovian, that is, the probability distribution of gene 
frequencies at a given time t, depends on the gene frequencies at a preceding 
time tọ but not on the previous history which has led to the gene frequencies 
at to, where tọ «€ t. 

The fundamental equations used to study this continuous Markov proc- 
ess are the Kolmogorov forward and backward equations (Kolmogorov, 
1931). The forward equation is called the Fokker-Planck equation by 
physicists and it was introduced into the field of population genetics by 
Wright (1945), while the backward equation was first used in this field to 
study the problem of gene fixation by Kimura (1957; see also 1962). 

Before going into the details of their derivation and use, we will present 
these two equations in the form that may be most easily applied to popu- 
lation genetics. 

Let us assume that a pair of alleles 4, and 4; are segregating in a popu- 
lation. We will denote by @(p, x; t) the probability density that the frequency 
of A, lies between x and x + dx at time t, given that it is p at time t =Q. 
Then, as will be shown later, ó(p, x; t) satisfies the following Kolmogorov 
forward equation: 


dpp, x; t) 


120? 
|^ Qt | 20x 


(Vs. (P, x; t)) 


2 8.3.1 
= ôx {M5 O(P, xX; t)}. 


The time parameter t in the above equation is a continuous variable and can 
be measured in any unit, but for practical purposes, it may most conveniently 
be measured here with the length of one generation as the unit. In this equa- 
tion, gene frequency x at the tth generation is a random variable but the 
initial frequency p is fixed. That is to say, we consider the process of change 
in the forward direction. Also M;, and V;, are respectively the mean and the 
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variance of óx, the amount of change in gene frequency per generation. 
Both M;, and V;, may depend on x and t. 

Next, let us consider the process of change in the reverse direction and 
assume the gene frequency x in the tth generation is fixed and the initial 
frequency p is a random variable. That is, we consider the process retrospec- 
tively. Then, as will be shown later, $ satisfies the following Kolmogorov 
backward equation: 


O¢(p,x;t) 1, 2 
PAS D 75 Von 3i 90 x; t) 


Ó 
tM, ap o(p, x; t). 


Actually, this is a restricted form of Kolmogorov backward equation that is 
applicable to the time homogeneous case. It is valid when both M;, and V, 
do not depend on the time parameter t. In many problems arising in popu- 
lation genetics, both M,, and V;,, though depending on the gene frequency x, 
do not depend on t, so 8.3.2 has a wide use. 

Returning to the forward equation, we will investigate how this may be 
derived. In this equation, p is fixed, so we will omit this letter and write 
d(x; t) for d(p, x; t). 

In a population of N diploid individuals, gene frequency changes 
from 0 to | in steps of size 1/2N, but in the present formulation, x is regarded 
as a continuous variable that changes smoothly from 0 to 1. Thus, ¢(x; t) 
gives the probability distribution at time t such that, when 1/2Nissubstituted 
for dx, (x, dx gives an approximation to the probability that the gene 
frequency is x at the tth generation. We can also interpret $(x; t)dx as the 
relative frequency or proportion of populations with gene frequency x among 
a hypothetical aggregate consisting of an infinite number of populations 
satisfying the same condition. Thus, if we denote the frequency of such a 
class by f(x, t), we have 


I(x, t) = b(x; t)dx, 8.3.3 


in which 1/2N may be substituted for dx. Here we should note that approxi- 
mation 8.3.3 as well as equation 8.3.1 is valid for the frequencies of unfixed 
classes; that is, for 0 < x < 1. As to frequencies of fixed classes, /(0, t) and 
f(1, t), separate treatments are required as will be discussed later. 

Several methods of derivation of the forward equation, differing in 
mathematical rigor and sophistication, are available. We would like to 
present first a very elementary derivation based on the geometrical inter- 
pretation of the process of change in gene frequency (Kimura, 1955). Though 
unsatisfactory from the pure mathematical standpoint, it has the advantage 
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of being quite elementary and at the same time helps us see the physical 
meaning of the terms involved. 

Let y = (x, t) represent the curve for the probability distribution of the 
gene frequency at time t. As shown in Figure 8.3.1, the distribution is approxi- 


y = (x. 0 






h 
x S h x xth 

Figure 8.3.1. Diagram to show the 
meaning of terms in the Kolmogorov 
forward (Fokker-PJanck) equation as 
applied to population genetics. (From 
Kimura, 1955). 


mated by histograms, each column having width A. We represent the gene 
frequency of each class by its middle point. Consider the class with gene 
frequency x. For sufficiently small 4, the area of the column (x, Hh gives 
the probability that the population has gene frequency in the interval 
x — lh to x+ th. After a time interval of length ôt, the population with 
gene frequency x will move to another class owing to systematic as well as 
random changes. However, because of the assumption of a continuous 
stochastic process, we make this movement sufficiently small by taking ót 
small that consideration of the two adjacent classes is sufficient. 

Let m(x, t)ót be the probability that the population moves to the higher 
class (x + h) by systematic pressure. Let v(x, t)ót be the probability that it 
moves outside the class by random fluctuation, half the time to the left class 
(x — h) and the other half to the right class (x + A). Any asymmetrical dis- 
placement may be included in the term m(x, t). Also, greater displacements 
than mentioned above should have an infinitesimal probability of higher 
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order (o(ót)) and can be neglected. Thus the probability that the population 
will have gene frequency in the range x — Mi to x + 4h after time ôt is 
obtained by considering the exchange of gene frequencies between these 
adjacent classes. 


P(x, t + Nh = (x, Hh — {v(x, t) + m(x, t)) dtd(x, Dh 
+ tu(x — A, t)dtd(x — h, t)h + 4u(x + h, t)ótó(x + h, Oh 
T m(x — h, t)óti(x — h, th. 8.3.4 
The second term in the right-hand side of the above equation is the 
amount of loss resulting from movement to other classes, the third term is 
the contribution from the left class, and the fourth from the right class, both 
due to random change; the final term is the contribution from the left class 
owing to systematic change. 
If we denote by V(x, t)ót the variance of the change in x per ôt due to 
random change, 
V(x, t)ót = h?40(x, t)ót + (— h) (x, t)ót 
or 
V(x, t) = hix, t). 8.3.5 
Similarly, if M(x, t)ót is the mean change in x per ôt, 


M(x, t)ót 2 hm(x, t)ót 
Or 
M(x, t) = hm(x, t). 8.3.6 


Substituting 8.3.5 and 8.3.6 in the right side of 8.3.4 and dividing both sides 
by hôt, we have, after some rearrangement, 


$lx, t + dt) — d(x, t) 
ot 


Volx +h, th -Vé(x,t) Vé(x,t) — Vé(x— h,t) 
(d h p h 
EP! h 





E M(x, t) - Mé(x — h, t) 
h y 


where terms such as V (x, t) and M@(x, t) are abbreviations of V(x, t)d(x, t) 
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and M(x, t)ó(x, t) respectively. Taking the limits ðt —^ 0, h — 0, we obtain 


a(x, 12 
oe p 5a (VO DG, 0) 





ð 
- 3 (MG. és D), 


which is equivalent to 8.3.1, V(x, t) and M(x, t) in this equation corresponding 
respectively to V;, and M;, in equation 8.3.1. We may note here that these 
two sets of quantities are defined in a slightly different way. Namely, in the 
above derivation V(x, t) represents the variance per infinitesimal time of the 
random component of change for which the mean is 0. Any systematic com- 
ponent of change is included in M(x, t). On the other hand, V;, and M,, in 
equation 8.3.1 represent the variance and the mean of the change of gene 
frequency per generation. In practice, quantities such as mutation rates, rate 
of migration, intensity of selection, and effect of random sampling of gametes 
which determine the rate of change in gene frequency are all measured or 
expressed with one generation as the time unit. So, for practical purposes, 
expressions V;. and M,, might be more convenient than V(x, t) and M(x, t). 

In the above derivation leading to 8.3.7, we assumed that the systematic 
pressure pushes the gene frequency toward the right, but no essential change 
is required for the argument if it pushes toward the left, in which case we 
simply use m(x + h, t)ótó(x + h, t)h as the last term of 8.3.4. From the 
above derivation it is evident that the first and the second terms in the right- 
hand side of 8.3.7, and therefore 8.3.1, give respectively the rates of change 
due to random fluctuation and systematice pressures. 

We will now attempt a mathematically more satisfactory but less 
intuitive derivation of the Kolmogorov forward equation (8.3.7). 

Let us denote by g(x, £; t, ôt) the probability density that the gene fre- 


quency changes from x to x + € during time interval (t, t + ôt). Then we 
have 


$(x; t + ôt) = [ec — č; 0)g(x — č, 6; t, dt) dé, 8.3.8 


where the integral is over all possible values of č. The above relation follows 
directly from the assumption that the process is Markovian. That is, the 
probability that the gene frequency is x at time t + ĝt is equal to the sum of 
probabilities of cases in which the gene frequency is x — € at time t and 
then the gene frequency increases by € during the subsequent time interval 
of length ót. In this expression, if we take ót small, then, because of the 
assumption of a continuous stochastic process, the change č during this time 
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interval is practically restricted to a very small value. Thus, expanding the 
integrand in terms of £, we have 











Q 2 e 
p(x; t + dt) = ffs -« dau) +$ p 
E i 9 69) | 8.3.9 
3! ax + dé, 


where $ and g stand for $(x; t) and g(x, č; t, ôt). In the following treatment 
we will assume that the order between summation, integration, and dif- 
ferentiation may be interchanged freely. From 8.3.9, neglecting terms involv- 
ing &? and higher powers of č, we have 


éG 190 7 o [aat - Z [o fe ade] 


8.3.10 


Noting that 


fod =l, 


transferring the first term in the right-hand side of 8.3.10 to the left, and 
then dividing both sides by ôt, we get 


$ó(x; t + dt) — ó(x, t) 
ót 


z 9 "s t) 1 - | ége, £5; t, dt) ač) 8.3.11 


3a [00s 03; [as 65 an ae} 
At the limit as ôt + 0, if we define M(x, t) and V(x, t) by 


M(x, t) — lim = [g(x, č; t, 51) d£ "T 
aro Ot 
and 


Vos elim Í g(x, Č; t, ôt) dé, 8.3.13 
310 Ôt 


equation 8.3.11 yields the Kolmogorov forward equation given before as 
8.3.7. The above derivation may still be unsatisfactory from the standpoint 
of mathematical rigor. For more rigorous derivations, readers may refer to 
the mathematical literature, for example, Kolmogorov (1931). 
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Going back to equation 8.3.1 and substituting $(x,t) for $(p, x; t), 
the Kolmogorov forward equation may most conveniently be expressed for 
our purpose as 


Q(x, 12 
a 2 a 2 ax? (Vox p(x, t)} 





2 8.3.14 
- = {Mae ds 0). 


In applying this equation to population genetics, it is often very useful 
to keep in mind that the quantity 


1 ô 
ET (V3. Q(x, t)} + Ma P(X, t), 
x 


which we will denote by P(x, t) and which enters the right-hand side of the 
above equation as — CP(x, t)/Cx, represents the rate per generation of net 
flow of probability mass across the point x. With the help of Figure 8.3.1 
we will again try to show this using a geometrical interpretation. 

The net flow of probability mass across the point x + A/2 during the 
short time interval (t, t + ôt), which we denote by P(x + 4h, t)ót, is given by 


P(x + 4A, t)ôt = m(x, r)ótó(x, Hh 
+ bx, t)óti(x, Hh 8.3.15 
— to(x + h, t)óti(x + A, th. 


Here we consider only the exchange of frequencies between the classes having 
gene frequency x and x + h. Substituting m(x, t)h = M(x, t) and wx, t)h = 
V(x, D/A in the above equation, we get 


P(x + th, t) = M(x, t)ó(x, 1) 
1 V(x +h, t)b(x + h, t) — V(x, NOb(x, t) 
-2 h i 
which gives 
P(x, t) = M(x, t)ġ(x, t) 


18 8.3.16 
= 2 ax {V(x, t)ó(x, t)} 


at the limit of 4-0. Again, for practical purposes, it is convenient to use 
the mean and the variance of the gene frequency change per generation, 
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that is, M4, and V4, for M(x, t) and V(x, t), to give 
i 
P(x,t) 2 -c ax Vs (x, 0) + My (x, t). 8.3.17 


With this expression for the probability flux, the forward equation may 
be written as 


Ap(x,t) — OP(x,t) 


Or = — Ox 8.3.18 


As pointed out following equation 8.3.3, our fundamental equation 8.3.1 
is valid only for gene frequencies in the interval 0 < x < 1 (unfixed classes). 
Therefore, separate treatments are required to obtain probabilities for x = 0 
and x — 1 (terminal classes). In order to obtain the rate of change in the 
frequencies of these terminal classes, we make use of the fact just established, 
that is, P(x, t) in 8.3.17 gives the probability flux across the point x at time f. 
Here we will consider a special but important case in which the change of 
these frequencies is entirely due to inflow of probability mass from the 
unfixed classes. In the terminology of the mathematical theory of probability, 
the boundaries (x = 0 and x = 1) act as absorbing barriers. In such a case, 
we have 





OD = — P(0, t), 8.3.19 
df(1,t) _ 
wo Pd, t), 8.3.20 


where /'(0, t) and f(1, t) are the frequencies of classes having gene frequency 0 
and 1! at the rth generation. 

In the particularly important case in which the random fluctuation is 
solely due to random sampling of gametes such that Vj, = x(1 — x)/2N,, 
and the systematic pressure is solely due to selection such that M,, = 
x(l — x)s(x, t), where s(x, t) is the selection coefficient, 





12 — 
— P(0, t) zm 5 ox i (x, n| — x(1 — x)s(x, t)ġ(x, 9 





Thus, from 8.3.19, we have 


E Len 0L B 


8.3.21 
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In the right-hand side of the above equation, $(0, t) is approximately equal 
to Q(1/2N, t). Since $(1/2N, t) (1/2N) is our approximation for the frequency 
of subterminal class f/(1/2N, t), we have 


ZU UNE x ) ( al ) 2322 





e 


That is, the contribution from the unfixed classes to the rate of change in 
the terminal class with x = 0 is half the frequency of the subterminal class 
with x = 1/2N multiplied by the ratio N/N,. In a special case in which the 
actual size of the population is equal to its effective size, this ratio reduces 
to unity. 

In a like manner, we have 


S t) -340 t) AN (=) x ; s(1 = oy r) (s) 8.3.23 


for the terminal class with x = | under a similar condition. 

The diffusion equation method can readily be extended to treat the 
cases of two or more random variables. For example, for two independently 
segregating loci, each with a pair of alleles A, and A, in the first locus and 
B, and B, in the second, the corresponding Kolmogorov forward equation 
becomes 


0b 12 1 2 ô ð 
dr Dane VP) + 333 Va) — 5: (44,9) 7 5; 045 9» EM 





where $ = jd(p, q; x, y; t) stands for the probability density that the fre- 
quencies of A, and B, are x and y at the tth generation, given that their 
frequencies are p and q at t = 0. 

Some of the applications of the forward equation to more concrete 
problems of population genetics, such as constructing the process of random 
genetic drift owing to small population number, will be presented in following 
sections. 

Next, we will attempt to derive the Kolmogorov backward equation 
given as 8.3.2, in which we consider x fixed and p a variable. In this formula- 
lation we reverse the time sequence and look at the process retrospectively. 
Also, in order to make our treatment simpler, we will restrict our considera- 
tion to the cases in which the process is time homogeneous; that is, we consider 
only those cases in which if x,, and x,, are frequencies of a gene at times !, and 
t,, then the probability distribution of x,,, given x,,, which in general is a 
function of t, and ¢,, depends only on the time difference t; — t,. 
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For such a time-homogeneous Markov process, we have 


$(p, x; t + dt) = Jaw. č; 0t)h(p + 6, x; t) dC. 8.3.25 


The above relation, which is analogous to relation 8.3.8, may be derived by 
considerations similar to those by which 8.3.8 was derived. Note here, how- 
ever, that g in the above relation depends on three variables, p, č, and ôt, 
but not on t. This is due to the assumption of time homogeneity, that is, the 
probability that the gene frequency change from p to p+é€ during time 
interval of length ót is the same for any t (generation). 

Expanding (p + £, x; t) inside the integral in terms of € but neglecting 
terms involving č? and higher powers of č, we obtain, at the limit of ôt — 0, 
the following equation: 


0j(p,x;t) V(p)0*Q(p, x; t) 


at 2 op? 
REN t) 8.3.26 
xi 
p 
where 
"E. 
M(p) — lim — | Eg(p, č; ôt) d£, 8.3.27 
dt-+0 ót 
NET 
V(p) ^ lim — f E2g(p, E; dt) dé. 8.3.28 
t0 ot 


Thus, substituting the mean and the variance of the amount of change per 
generation, Msp and V;, for M(p) and V(p), in the above equation, we obtain 
8.3.2, the Kolmogorov backward equation as applied to population genetics. 

One of the very important uses of this equation is its application to the 
problem of gene fixation. If we take x = 1, $ in the backward equation may 
be interpreted as the probability that the gene becomes fixed (established) 
in the population by the tth generation, given that it is p at the start. We will 
denote this probability by u(p, t), for which we have 


du(p, t) = Vsp d°u(p, t) a du(p, t) 
ô | 2 op ED 


The probability of fixation will then be obtained by solving the above equa- 
tion with the boundary conditions 


u(0, t) = 0, u(l, 4) =1; 8.3.30 
that is, the probability is 0 if p = 0 and is 1 if p = 1. 








. 8.3.29 
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Of special interest in population genetics is the ultimate probability of 
gene fixation defined by 


u(p) = lim u(p, t), 8.3.31 
(70 


for which 


OH e: 
et 


Thus, u(p) satisfies the ordinary differential equation 
Vss d'u(p) du(p) _ 


2 dp? E dp 








0 8.3.32 


with boundary conditions 
u(0) = 0, u(1) = 1. 8.3.33 


The problem of gene fixation is important in the theory of evolution and 
also for the study of breeding. We will elaborate the application of equation 
8.3.32 to this problem later in this chapter. In the next few sections, we will 
apply the Kolmogorov forward equation to solve some concrete problems 
arising in population genetics. 


8.4 The Process of Random Genetic Drift 
Due to Random Sampling of Gametes 


We will first consider the simplest situation in which a pair of alleles A, 
and A, are segregating with respective frequencies x and 1 — x in a random- 
mating population of N monoecious individuals and the only factor which 
causes gene frequency change is the random sampling of gametes in repro- 
duction. As time goes on, the gene frequencies tend to deviate from their 
initial values and eventually one of the two alleles becomes fixed in the 
population. This is the simplest important stochastic process in the change of 
gene frequencies in a Mendelian population. Since Wright's work in 1931, the 
process has been known by the term ‘drift? or more adequately ‘random 
genetic drift’. 

In the previous chapter. we studied the law of change in the mean, 
variance, and the higher moments. The mean and the variance of the change 
in gene frequency x per generation are, respectively, 


My = 0 
and 


» x(1 — x) 


Va. 2 N 
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as shown in 7.3.1 and 7.3.2. For simplicity, we assume in this section that the 
effective population number N, is equal to N. Thus the forward equation 
8.3.1 becomes 


ð 1 e? 
f= ael-  O<x<0), as 


where $ = $(p, x; t) is the probability density that the gene frequency be- 
comes x in the tth generation, given that it is p at t = 0. In terms of the Dirac 
delta function ó(-), the initial condition may be expressed in the form 


$(p, x; 0) = d(x — p). 8.4.2 


The required solution of 8.4.1 that satisfies the initial condition 8.4.2 was 
first obtained by Kimura (1955a). It is expressed in terms of the hyper- 
geometric function as follows: 


$P x; t) = Y p(1 — pyi(i 4- DQi 4 DF( — i, i + 2, 2, p) 
i=1 


8.4.3 
~i(i+1)t 
x Fi -i,i+2,2,x)e 4N , 
where F(', +, *, +) stands for the hypergeometric function so that 
a (1 — ii + 2) 
At Pa aa eg 
8.4.4 
(1 — (2 — D + 2) + 3) p 
is 1x2x2x3 2 


The above solution, 8.4.3, may also be expressed in terms of the Gegenbauer 
polynomial (see Korn and Korn, 1968), defined by 


1— 
T}_(z)= -— Dp (i 2.1 -intgl 8.4.5 
as follows: 
o (2i+ 1) —r? liti» 
b(p, x; t) = 2 en Tj-(r)Tj-\(z)e *" , 8.4.6 


where r—-1- 2p, z=1—2x, and Ti(z)=1, T}(z)=3z, TH(z)= 
(3/2)(5z? — 1), T3(z) = (5/2)(7z? — 3z), Ti(z) = (15/8)(2124 — 1427 + 1), etc. 
The right-hand side of equation 8.4.3 or 8.4.6 is an infinite series, but 
for a large value of t, only the first few terms are of any significance in deter- 
mining the actual form of the distribution. Thus, for a large t, we have 


O(p,x; t) = 6p(1 — p)e^zt -30p(1—p)1-2p)1-2x)e3N: +. 847 
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In particular, at the limit of t > oo, we obtain the asymptotic formula 


1 
-—1 
od ~ Ce 2N : 8.4.8 


in which C is a constant. This means that after a large number of generations, 
the probability distribution for unfixed classes (0 < x < 1) becomes flat and 
decays at the rate of 1/(2N) per generation. This is called the state of steady 
decay, and 1/(2N) corresponds to the smallest eigenvalue of the partial 
differential equation 8.4.1. The relation 8.4.8 was first obtained by Wright 
(1931). Figure 8.4.1 illustrates such a state of steady decay, when fixation or 
loss of an allele occurs at a constant rate. 


T ^ 





0 2596 5096 75% 100% 
x 

Figure 8.4.1 The distribution 
after many generations 
(roughly, any time after t = 2N) 
when the distribution is of 
steady form. All frequencies 
between 0 and 1 exclusive are 
equally probable and are 
decreasing at the same rate, 
1/2N. Fixation or loss of the 
allele proceeds at a constant 
rate, 1/4N. (Adapted from Wright, 
1931.) 


The complete solution, 8.4.3, enables us to construct the more detailed 
process of change in the frequency distribution of unfixed classes as shown 
in Figure 8.4.2a and 8.4.25. In Figure 8.4.2a, the initial gene frequency (p) 
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is 0.5. It may be seen from the figure that after 2N generations the distribution 
curve becomes almost flat and the genes are still unfixed in about 50% of 
the cases. In Figure 8.4.25, the initial gene frequency is assumed to be 0.1 
and it takes 4N or 5N generations before the distribution curve becomes 
practically flat. By that time, however, gene A, is either fixed in the population 
or lost from it in more than 907, of the cases. So the asymptotic formula 





(a) (5) 


Figures 8.4.2a,b. The process of random genetic drift due to small population 
number, in which it is assumed that mutation, migration, and selection are 
absent, and the random change in the gene frequency from generation to 
generation is caused by random sampling of gametes. In Figure 8.4.2a, 

the initial frequency of A, is 0.5, while in Figure 8.4.25, the initial 

frequency is 0.1. [n both figures, ¢ stands for time and N stands for the 
effective population number. The abscissa is the frequency of A, in the 
population and the ordinate is the corresponding probability density. 

(From Kimura, 1955a.) 


8.4.8 may not be as useful for p = 0.1 as in the case p = 0.5. Actually, from 
8.4.7, C = 6p(1 — p) and this constant is small if p is near 0 or 1. Going back 
to the complete solution 8.4.6, the probability, O,, of both A, and A, 
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co-existing in the population in the tth generation may be obtained by inte- 
grating (p, x; t) with respect to x from 0 to 1. This gives 


.QGjt1X2j*2» 


Q, = 2. (Par) — Pi .2(r)je an, B.4.9 
j= 


where P(-) represents the Legendre polynomials; Po(r)=1, P,(r)=r 
P,(r) = (1/2X38r? — 1), etc. The above formula is an infinite series, but for a 
large t, we may use the following formula to compute the value of Q,: 


: 6 
Q, = 6p(1— pje ?* + 14p(1 — p(1— 5p + Spe 2N +, 9^7 


The frequency of heterozygotes or the probability that an individual in 
the population is heterozygous at a given generation can also be obtained 
by using 8.4.6 as follows: 


1 t 
H, = | 2x(1 — x)p(p, x; t) dx = 2p(1 — p)e 24, 8.4.11 
0 


This shows that the frequency of heterozygotes decreases exactly at the rate 
of 1/(2N) per generation. It agrees with 7.3.7 which was obtained by an 
elementary method. Actuallv, this holds also for multiallelic cases and is 
independent of the number of alleles involved. 

The above treatment should have made it clear that the genetic hetero- 
geneity of a population and the heterozygosity of an individual are not only 
distinct conceptually but also their probabilities Q, and H, are different. 

The processes of change in the probability distribution of fixed classes 
may be obtained by using relations 8.3.21 and 8.3.23. The frequency of the 
class in which gene A, is fixed, or the probability that 4, becomes fixed by 
the tth generation, is as follows: 


l t 
f(p, ies (p, 1; t) dt 


l Lad i(i * 1)t 
ADO p gee e o san 


iti t 1)t 


= p+ ¥ (21-4 DI- DF 2,1— i, 2 p De Sm. 





Similarly, the frequency of the class in which A, is lost may be obtained by 
integrating (p, 0; t)/(4N), or more simply, by replacing p with (1 — p) in 
the above expression for f(p, 1; t). With these expressions, it can be shown 
that 


f(p,0:r - Q, -f(p.1:n)-1 
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and, at the limit of t = oo, we have 
f(p,0;o)21- p, Q.,-0, f(p, 1; 0) = p. 8.4.13 


We might very roughly characterize the above process of random genetic 
drift by the following example. If we start out with 1000 populations, each of 
size 100 individuals and containing 50% of gene A,, then after about 200 
generations A, is either fixed or lost in roughly 500 populations. In the 
remaining 500, the distribution of the frequency of A, is practically flat. 
When this state is reached, 1/200 of the unfixed populations become fixed 
for A, or its allele each generation from that time until eventually every 
population will be homogeneous for either A, or its allele. 

In his first treatrhent of the process of random genetic drift, Fisher 
(1922) used the transformed gene frequency rather than the gene frequency 
itself. A main reason for this is that if the gene frequency is transformed 
from x to 0 by the relation (angular transformation) 


0 = cos! (1 — 2x), 8.4.14 


the sampling variance of gene frequency per generation becomes roughly 
independent of the gene frequency, where 0 in radians changes from 0 to x 
as x changes from 0 to 1. This may be seen as follows: From the above 
relation, we have 


oda eus ned aE Pt ais 
“Taio 340-3899 t7 2 


where ôx is the amount of change in x per generation and óÓ is the corre- 
sponding change in 0. Then, if we note that 


ó0 


x(1— x 
Ms» = E(óx) = 0, V;, = E{(dx)*} = USE 8.4.16 
2N 
we obtain, after neglecting higher order terms, 
M5 = t0 
86 = — aN co 
8.4.17 
TM 1 
9 2N 


It follows then that if a population starts from a fixed gene frequency p, the 
variance of 0 after t generations is given approximately by 


t 
V,(t) = . 8.4.18 
ft) 2N : 


388 AN INTRODUCTION TO POPULATION GENETICS THEORY 


We should note here that the above formula for V, is valid only when t is 
much smaller than N. Also, we should note that the expected value of 50 
now depends on 6, since in calculating E(ô0) from 8.4.15, the second term 
in the right-hand side is nonzero and cannot be neglected even if the first 
term is zero. Fisher (1922) neglected Mj and wrote the differential equation 
for the probability distribution using only Vj, = 1/2N. This led him to the 
incorrect result of 1/AN as the rate of steady decay. Later (1930) he incor- 
porated Ms = — (cot 0)/4N into the equation to obtain the correct result. 

Nevertheless, this type of transformation which makes sampling variance 
nearly constant is rather convenient for treating data on random genetic 
drift over a relatively short period or if the gene frequency is restricted to a 
range not very far from 0.5. 

So far we have considered a pair of alleles A, and A,. With more than 
two alleles the situation is of course more complex, but the same principles 
apply. Figure 8.4.3 shows the steady-state situation for three alleles. At this 





A? 


Figure 8.4.3. The distribution for three 
alleles at a steady state under random 
drift. (From Kimura, 1955b). 


state the rate of change from populations with three alleles to populations 
with two alleles is 3/2N per generation. Then each of the populations with 
two alleles changes to a population with one allele at a rate of 1/2N, as 
shown above. At the same time the rate of fixation is 1/2N for each of the 
three alleles, so that every generation 3/2N of the populations becomes fixed 
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for one of the three alleles. The extension to more than three alleles follows 
naturally. The rate of change, when a steady state has been reached, from k 
alleles to k — 1 alleles is k(k — 1)/4N per generation, where N is the effective 
population number. This result is from Kimura (19555). 

The problem becomes much more difficult if we consider two linked loci 
that are segregating simultaneously. Although a theory comparable to that of 
single locus has not been developed, the amount of linkage disequilibrium 
caused by random sampling of gametes in a finite population has been 
clarified by Hill and Robertson (1968) and also by Ohta and Kimura (1969). 
Let us assume that a pair of alleles A, and A; are segregating in the first locus, 
and B, and B, in the second locus. If we denote by X,, X,, X}, and X, the 
respective frequencies of the 4 types of chromosomes A, B,, A,B, , A2 B,, and 
A,B,, then D = X,X, —X,X, represents the amount of linkage disequilbrium. 
Hill and Robertson (1968) showed that in a small population E(D?), that is 
the mean square of D, may become large even if E(D), the mean value of D, 
is 0. Using the method of moment generating matrix, they obtained analytical 
expressions for the 3 quantities, E(x(1 — x)y(1 — y)}, E(D(1 — 2xy1 — 2y)}, 
and E(D?) in the case of no crossing-over, where x and y are respectively the 
frequencies of A, and B, in the first and second loci. Ohta and Kimura (1969) 
obtained more general expressions for an arbitrary recombination fraction c, 
based on the diffusion models. An interesting property first discovered by 
Hill and Robertson is that the expectation of r? = D?/(x(1 — x)y(1 — y)) 
settles down quickly in the process of random drift to a constant value which 
depends only on N,c. Note that r is the correlation coefficient between gene 
frequencies at two segregating loci. They also inferred from simulation 
studies that E(r*) approaches l/(4N,c) as N,c increases. Ohta and Kimura 
(1969) considered a quantity o2 = E(D?)/E(x(1 — x)y(1 — y)) and showed 
that it takes a value similar to E(r?). They obtained an analytical expression 
for o2 and showed that 


6l = MANUS) 


for a large N,c. 

It is interesting to note that a relation similar to this holds for the case in 
which a steady state is reached with recurrent mutations and random genetic 
drift as shown by Ohta and Kimura (19694). 


8.5 Change of Gene Frequency Under Linear Pressure 
and Random Sampling of Gametes 


In the previous section, we have considered the process of random drift 
caused by random sampling of gametes alone. We will now investigate the 
process in which the effects of mutation and migration are also included. 
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Since the effects of mutation and migration on the rate of change in gene 
frequency are linear functions of the gene frequency, we may call them 
collectively linear pressure. 

Let us consider a random-mating population of effective size N, in 
which a pair of alleles 4, and A, are segregating with respective frequencies 
x and | — x. If we suppose that this population exchanges individuals with 
another population at the rate m per generation, then the rate of change 
in x due to this cause is 


m(x; — x) 


per generation, where x, is the frequency of A, in the immigrants (cf. 6.5.1). 
Here we will assume that x, is a constant. This may be a good approximation 
if the immigrants represent a random sample from the entire species. If 
mutation rates are not negligible, we may replace m by m + u + v and mx, by 
mx, + v, where u and v are respectively the mutation rates of A, to and from 
its allele 45. Though the pressure of selection is intrinsically nonlinear, in 
certain cases, like selection acting at the neighborhood of the equilibrium gene 
frequency, it may be treated as if it were linear with good approximation. 
However, the range of applicability is quite restricted. 

The change of mean, variance, and the higher moments of the distri- 
bution of gene frequency under linear pressure and random sampling of 
gametes can be worked out by applying the method we used in Chapter 7, 
Section 4 for the case of random sampling of gametes alone. In the present 
case we note that 


E(x) = m(x; — x,) 8.5.1 


rather than £,(dx,) = 0 as in 7.4.5, where x, is the value of x at the rth 
generation. For the mean, we obtain 


up = (1 = my + mx, 
which leads to 


dpi? 
dt 


ft 


= —m(u? — xj) 





for the continuous model. The solution of the above equation gives the mean 
gene frequency at the tth generation: 


sir) —mt 


ui o —xpt(p-xpe ™, 8.5.2 


where p is the value of x at t = 0. Similarly, we can work out the second and 
the higher moments step by step. The general formula for the nth moment 
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of the gene frequency around the origin is as follows (Crow and Kimura, 
1956): 
un - (^) EE ee 
Ha Tr 


il (A t- n t i (B t (A — BI(A + 2i — 1) xs 





-l 
— l, —i, A — B, 1 — — 
x F(A +i i, p) exp| im + T aN, \ | 


In the above c ation, 4 =4N,m, B=4N, mx, (l > x, > 0), p is the initial 
frequency ws 4,, and F(-,-, °, <) denotes the hypergeometric function. 

Since in thus case the mean and variance of the rate of change in gene 
frequency x are respectively 


M, = n(x; — x) 8.5.4 

and 
x(l — x) 
V, =y ^C 3. 
bx 2N, 8.5.5 

the forward n 8.3.1 becomes 

op — l 

t ~ AN, ax 4.2 {x(1 = x)} - "ns (x, E x)6). 8.5.6 


where @ = $(p, x;1) is the probability density that the frequency of A, 
becomes x at the /th generation, given that it is p at t = 0. The initial con- 
dition for the equation is 


é(p, x; 0) = d(x — p). 8.5.7 


The moment formula 8.5.3 suggests that the solution to the above equation 
must have the form 


O(p, x; i) = » X (x) exp| - i aL) i, 8.5.8 





By comparing 
H 
n? = | x'ó(p, x; t) dx 
re 
with 8.5.3 we can get the appropriate expression for $, which turns out to be 
the pertinent solution of 8.5.6. It is given by 8.5.8 in which 
X(x) = x !(1— x^" ^!F(Ai—1, Ci, A— B, 1 — x) 
x F(A +i-1, —i, A— B,1 — p) 


I(A — B + i)I (A + 2i)T(A + i — 1) 8.5.9 
i'T^(A — BOT (B + i) (A + 2i — 1) 
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At t > oo, our formula 8.5.8 converges to Wright's well-known formula 
for the steady-state gene frequency distribution under migration: 


T(4ANm) 
T(4Nmx)I(4Nm( — xp) 


eumd x x) mc AD- 1 


$(p, x; 00) = 


8.5.10 


Figures 8.5.1a, 8.5.1b, and 8.5.1c show the asymptotic behavior of the dis- 
tribution curve for three different cases: 4N,m = 02, 4N,m — 2, and 
4N,m = 6. In all the three cases illustrated the gene frequency, x,, of the 
immigrants is 0.5 and the initial gene frequency, p, of the population is 
assumed to be 0.2. We will study the nature of steady-state distribution 
(8.5.10) in some detail in the next chapter. In the above treatment, it has 
been assumed that x; is neither O nor 1. In terms of mutation pressure alone, 
this corresponds to the case of reversible mutation for which u > 0 and v > 0. 

Next, we will investigate the case of irreversible mutation. Let us assume 
that A, mutates to A, at the rate v per generation (v > 0) but there is no 
mutation in the reverse direction (u = 0). If x, is the frequency of A, at the 
Ith generation, then the amount of change in x, in one generation is 


6x, = v(1l — x) + 6, 8.5.11 


where č, is the amount of change due to random sampling of gametes with 
mean and variance 
x,(1 z x,) 

E(£)-20, E,(€7) = —2N, ^ 8.5.12 
Using the same procedure we used to derive 8.5.3, as well as 7.4.37 in the 
previous chapter, we obtain the following formula for the nth moment of 
the gene frequency distribution about origin in the tth generation: 


TORRE MESS MITES I(c-4 n)(c-i- 1)(c + 2i-1) 
Henne » È (y p T(c)T(c +n +i) 


i-—1 
x F(1— i, i+c,c, {-i( + y) 
( i, i p) exp; —ilv aN, 
where c = 4N,v, and exp {+}, I (^), and F(-, -, +, +), respectively, denote the 
exponential, the gamma, and the hypergeometric functions. 
In particular, the first two moments are 


8.5.13 











ui? =1-— (1 — pe" 8.5.14 
and 
2c +2 c 1 
"dtd e * —(1— _ | -i(v t 
H2 ( p) PEE ( pir» PEUT, e 4N,/ . 8.5.15 
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2.0 


4 Nm = 02 





0.5 
0 0.5 1.0 


{b} (c) 


Figures 8.5.1a,b,c. Asymptotic behavior of distribution curves for a finite 
population with migration or other linear pressures. In all three drawings, 
the gene frequency of the immigrants is assumed to be 0.5, and the initial 
frequency in the population 0.2. The abscissa is the gene frequency x; 

the ordinate is the probability density $. N: population number. m: rate 

of migration. (From Crow and Kimura, 1956.) 
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The gene frequency distribution ¢(p, x; t) which satisfies the forward equation 


0$ 1 e? ð 

r7 AN. {x(1 — x)9) ^ v ox {(1 — x)$) 8.5.16 
can be worked out as in the case of reversible mutation. The pertinent 
solution is as follows. 


8 (c 1-4 (C+ 14 DE(c + i) 
$p x D= —— mW por 


x(1— p)F(—i,i +c + 1,c, p) 


-1 zs _ fit Dit Dr 
xx" F(-ii-c-c-cl,c,x)e 4Ne 


=c(c + 1)(1 — p)x* le^" 8.5.17 
^(c +1 3 2 
+ ERUERA - p)(1 ELE. pe 
2 c 
x (1 Se) 0m) Tc, 
c 


where c = 4N,v and 0 < x « 1. 
For a very large t, we have the asymptotic formula 


(Pp, x; t) ~ AN v(AN,v + 1)(1 — p)x*""7!e^"t, 8.5.18 


At this state, the distribution curve for intermediate gene frequencies decays 
at the rate of v per generation, as shown by Wright (1931). 

The probability that 4, becomes fixed by the tth generation or the fre- 
quency of the terminal class with x = 1 can be obtained by letting n — oo 
in uj (9. This yields 


E (8, wea Ei 19 ee 2i — 1) 
qi)studheprpope c ee 





i—-1 
F(l —i,i+c,c, p)exp\—i t 
x F( c,c, p)e Pf io ax) 


(c+ 2c+ 3m : 8.5.19 


5 p) 


-1-(c- 1 - pe^" 


1 
x »- nj ta -— 
2+c 
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For a large t, we have 
fl, n) ^1 — (4N,v + 1)(1 — poe" 8.5.20 


showing that A, becomes irreversibly fixed in the population. Also, from 
8.5.17 and 8.5.19, we note that 


1 
fa, N+ [ (px: t dx: 


Figure 8.5.2 illustrates the probability distribution of intermediate gene fre- 
quencies for c= 4N,v = 0.5 at t = 12N,, assuming p = 0. 
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Figure 8.5.2. Gene frequency distribution 
under irreversible mutation for the case of 
c=4N = 0.5, t = 12N. , and p = 0. 
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8.6 Change of Gene Frequency Under Selection 
and Random Sampling of Gametes 


For our understanding of the process of evolution and breeding, it is quite 
important to clarify the process by which gene frequencies change in a finite 
population under selection. However, because of the mathematical diffi- 
culties involved, so far only the simplest case has been solved completely. 
The main difficulty stems from the fact that under selection the rate of change 
in gene frequency is a nonlinear function of the gene frequency. In the present 
section we will summarize the main results obtained by Wright, Kimura, and 
Robertson. 


8.6.1 Genic Selection 


Let us consider the simplest case of no dominance, where the heterozygote 
has fitness midway between the fitnesses of the two homozygotes. We will 
assume that a pair of alleles 4, and A, are segregating with respective fre- 
quencies x and | — x in a finite population of effective size N,. If we denote 
by s the selective advantage of A, over A, or, more precisely, if we assume 
that the selective advantage of A, over A, for a short time interval of length 
ôt is sót, then the mean rate of change in x per generation is given by 


M, = sx(1 — x). 8.6.1.1 
The variance of the rate of change in x is 


xd — x) 


V, 6.1, 
ie IN, 8.6.1.2 


Thus, the forward equation 8.3.1 becomes 


2 

T x an. = (x(1 — x)o} ^ s 2 (x(1 — x)} (0<x <1), 8.6.1.3 
where $ = $(p, x; t) is the probability density that the frequency of A, 
becomes x at the tth generation, given that it is p at t = 0. This equation has 
been used to analyze the gene frequency change in very small experimental 
populations of Drosophila melanogaster (Wright and Kerr, 1954). In that 
paper, Wright devised an ingenious method to analyze the process of steady 
decay in which the distribution curve, while keeping a constancy in form, 
decreases in height at a constant rate determined by the smallest eigenvalue 
(Ao) of equation 8.6.1.3. The complete solution of the problem has been 
given by Kimura (Kimura, 1955; Crow and Kimura, 1956). 
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In order to solve equation 8.6.1.3, let us put 
$ oc e?* V(x) e^, 


where c = N,s and V(x) is a function of x but not of t. If we substitute this 
in 8.6.1.3, we have 
dy dV 5 
x(1 — x) PR + 2(1 — 2x) E {2 + 4c x(1 x) - ANLA)V =0. 8.6.1.4 


x? 


Then by the substitution 


(l-z) 
x dn 8.6.1. 
2 5 
equation 8.6.1.4 becomes 
dv dV 
(1 — 27) —, — 4z — + {(4N,A — 2 — c?) + c?z?)v =0, 8.6.1.6 
dz dz 


where z = 1 — 2x (—1 <z < 1). This type of differential equation is known 
as the oblate spheroidal equation. Actually, 8.6.1.6 is a special case of the 
oblate spheroidal wave equation, 


(1 — z?)V" — 2(m + DzV' + (b + c?z?)V =0, 8.6.1.7 


with m= 1 and b = 4N,A — 2 — c^. We want here the solutions which are 
finite at the singularities, z = +1, and reduce to the Gegenbauer polynomial 
if there is no selection, that is, if s = 0 and therefore c = 0. This is 
because if there is no selection the process reduces to that of random drift 
which was discussed in 8.4. Such a solution has been studied by Stratton 
et al. (1941) and is expressed in the form 


VaX2)-2 Y firi). 8.6.1.8 
n=0.1 


where k = 0, 1, 2, ... (corresponding to / in the notation of Stratton et al.). 
In the above expression, fs are constants and T1(z)’s are the Gegenbauer 
polynomials defined in 8.4.5. The primed summation is over even values of n 
if k is even, odd values of n if k is odd. 

The desired solution of 8.6.1.3 is given by summing V{J)(z) for all 
possible values of k, after having multiplied through by e^?** ^, where A, 
is the kth eigenvalue. 


(P, x; t) =} Ci ge tz YC) 8.6.1.9 
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The coefficients, C,'s, can be obtained from the initial condition 


(p, x0) = d(x — p), 8.6.1.10 
by using the orthogonal relations of the eigenfunctions; 


(n +2)! 


[a - vivir dz = 842 Y TIU 


They are given by 


joco 2 eG) 
k f (n + 1)(n + 2) (y , 8.6.1.11 


a50,01 (2n 3) 


where r — 1 — 2p, c — N,s, and the primed summation is over even values 
of n if k is even, over odd values of n if k is odd. 

The solution 8.6.1.9 with coefficients given by 8.6.1.1] gives the prob- 
ability distribution of unfixed classes in the tth generation. As £ increases, 
the exponential terms in 8.6.1.9 decrease in absolute value very rapidly, and 
for large t only the first few terms are important. The numerical values of the 
first few eigenvalues Ap, A,, and A, can be obtained from the tables of the 
separation constants (B,,,) which are listed in the book of Stratton et al. 
(1941). This is done by using the relation 


4NA, = c — B,,,. 


The boundaries x = 0 and x = 1 act as absorbing barriers and the smallest 
eigenvalue Ag gives the ultimate rate of decay of the frequency-distribution 
curve for unfixed classes; that is, 


and has special importance. It is equivalent to K in Wright and Kerr (1954). 
For small values of c, the eigenvalues can be expanded into a power series 
in c and we obtain the following formula for A) (Kimura, 1955): 


2 2 24 
2NAg = 1 += ———c* a 


CES Oa Ec EE 
2x31 , 17507389 m 
— CC ru —————————————À| Lf. 
56 x 7^ x Il 2x3x9xTxllxlà" 
8,6.1.12 
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or in terms of 2c = 2N, s, it is also given by 


2N.sS? (2N.s)* 2N,s)Ó 
2N, Ap = 1+! eS)" (2Nes)" QN.) 
10 7,000 . 1,050,000 8.6.1.13 


— 4.108 x 107 (2N, s)? — 7.869 x 10^ ! (2N, s)!? -- 


It is impressive to see that Wright's empirical formula (Wright and Kerr, 
1954, p. 236) is exact up to the coefficients of the order of 10^?. For the 
purpose of numerical calculation the above power series is not suitable 
except for small values of c. According to Stratton et al. (1941), convergence 
of the power series is usually slow and it proves unsatisfactory for values of 
c? > 1. However, in the present case (m = 1, / = O in the spheroidal function), 
the above series gives the right answer to three significant figures even for 
c = 3. This agrees with the statement of Wright and Kerr (1954). Figure 
8.6.1.1 gives the relation between c and 2N4,. More exact values are listed 


Genic Selection 





—> 2N `o 




















o 


10 20 30 40 5.0 6.0 7.0 8.0 
—» Ns 

Figure 8.6.1.1. Relations between rate 
of steady decay (Ao) and intensity of 
selection (s) in the process of genic 
selection in finite populations. N is 
effective size of population. (Kimura, 
1955). 


in Table 8.6.1.1. From Figure 8.6.1.1 it looks as if 2N4, increases linearly 
with c for large value of c, though having no proof we are not certain about 
it. The eigenfunctions V{}(z) corresponding to 4,'s are given by 8.6.1.8. The 
coefficients f* corresponding to the first three eigenvalues are found in the 
tables of Stratton et al. (1941, pp. 116, 118, and 120). It will be noted here 
that for c = 0, all the formulae given above reduce to the ones for the case 
of random drift studied in 8.4. 
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Table 8.6.1.1. Relation between c( — Nes) and 2NA, . (From Kimura, 1955.) 


c 2NeAo c 2N. Ào 
0.0 1.00000 3.5 5.43183 
0.5 1.09985 4.0 6.54540 
1.0 1.39765 4.5 7.66121 
1.5 1.88771 5.0 8.75330 
2.0 2.55927 6.0 10.85728 
2.5 3.39445 7.0 12.89983 
3.0 4.36529 8.0 14.91989 


Note: In the new table of spheroidal wave functions by Stratton et al. (1956), t 
is tabulated for c (denoted as g in the table) up to c = 8.0 (pp. 506-508), from 
which 2NA, can be calculated by the relation 


3: dies 
Nay = 1+ (2-5): 


The eigenfunction V{})(z) corresponding to the smallest eigenvalue A 
is of special significance, since it gives the frequency distribution of unfixed 
classes at the state of steady decay, when it is multiplied by e°“. It is 
expressed by 


VRZ) = f$ T2) + FITI) + fg THz) ++. 8.6.1.14 


The coefficients fo, f$, f$, etc., depend on c. When c = 1.0, for example, 
fà = 1.0208, f$ = 0.013980, f? = 0.000096, etc. 

Figure 8.6.1.2 illustrates the distribution at steady decay of unfixed 
classes for some values of c. The area under each curve is adjusted so that it 
is unity. The case c = 1.7 corresponds to the case experimentally studied by 
Wright and Kerr (1954) and the present result agrees quite well with theirs 
(including the rate of decay). The rate of fixation of 4, may be calculated 
from 


df(p, 1; t) ep, 1; 1) 


Ji AN, C 8.6.1.15 


as shown in 8.3.23. Here f(p, 1; t) stands for the probability that 4, has 
become fixed by the tth generation. Similarly, for the rate of loss (or fixation 
of A,), we have 


df(p.O;)) (p 05) 


dt 4N, 8.6.1.16 
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Figure 8.6.1.2. Frequency distribution 

of unfixed classes at the state of steady 

decay for various values of Ns. The 

area under each curve is adjusted so 

that it is unity. Numerals beside the ‘ 
arrows indicate rates of steady decay. 

N is effective size of population; s is 

selection coefficient. (From Kimura, 

1955.) 


8.6.2 Case of Complete Dominance 


Let us suppose that A, is completely dominant over A, and the dominant 
genotypes A,A, and 4,4; have selective advantage s over the homozygous 
recessive A, A5. If x is the frequency of A,, then the mean and the variance 
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of the rate of change in x are 





Mg, = sx(1 — x)? 8.6.2.1 
from 5.3.9 and 
Vex = ae 9 8.6.2.2 
2N, 


respectively, so that the probability density $(p, x; t) satisfies the differential 
equation (cf. 8.3.1); 


ap 1 a ð 2 
ar = an Ax OU — x)$) — 5 ax (x(1 — x)*4) 8.6.2.3 


with the initial condition 
ép, x; t) = ó(x — p). 8.6.2.4 


If we apply the transformation 


o= exp| - 2 t x (1 — =) 


8.6.2.5 
l-z 


2 





x= 


to the differential equation 8.6.2.3, we obtain the ordinary differential 
equation 
c c? 
(1 — z?)w" — 4zw' + {A -2- 5 (z —1)+ 4g — D + aw = 0, 
8.6.2.6 


where A = 4N, À and c= N, s. 
As in the previously treated case of genic selection (sec. 8.6.1), we try 
to expand the solution into a series of Gegenbauer polynomials. Let 


w=) d,T}(z), 8.6.2.7 
n=0 
where d,’s are constants. If we substitute this into 8.6.2.6 and repeatedly use 


the recurrence relation 


n+2 
2n+3 


in which we set 


T!,(z) =0, 


2T,(z) = 





n+1 
T,-1(z) + 2n 43 Ti. i2). 
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we obtain a nine-term recursion formula for the d,'s. Now we expand A 
and the d,'s into power series of c: 


A-kgc kc kc kac) +t, 
d, — (alc - alc? c adc? 9: de, 
d, = (alc alc? alc? +°-') do, 
d, = (xf? + e talch 9) ds, 
d, = (xc? - a$c -:::)d,, 
d, = (a3c? +*+) do, etc., 
and substitute these into the recursion formula. By picking out coefficients 
of equal power of c, we can determine the k’s and a's, by means of which 
the eigenvalue Å (or A) and the eigenfunction w are expressed. The most 
important information is the smallest eigenvalue (Ag), which gives the rate 
of steady decay, and the corresponding eigenfunction. To get Ag, we set 
ko = 2, since for c = 0, A(=4N, A) should be 2, as shown in the treatment of 
pure random drift in which the final rate of decay is 1/2N, (see 8.4). 
Though the calculation involved is quite tedious, we can obtain the 
desired coefficients step by step. For the smallest eigenvalue Àg we get 
(Kimura, 1957), 
1 " 199 24 17 à 
— ———————— a c —————— 
5 2x5 x7 2x5?x7 
23 x 41 x 29599 


2Ne Ao = 1 = 


4 


— = “ - 8.6.2.8 
Px x 5x Px il 
= 1 —(0.2)c + (0.113714 -: )c? 
+ (0.000388 - - -)c? — (0.002191 ---)c* —--+. 
The coefficients of the eigenfunction are as follows: 
1 11 
1 _ Due. = 
eae 2x3x7 3 3*x53x7 
1 13 
t= — — a. 9 = ae 0 & - —7.31 x 1079, 
E Ix3x5 3 31567 x 8.6.2.9 
1 
a 


^ 2x3x5x7T ^" 
a5 7 2.49 x 1074, etc. 


The same method may be applied to get similar expansions for other eigen- 
values and eigenfunctions. 
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The gene frequency distribution at the state of steady decay is given by 


x 
(x)= exp|2ex( -— =) jo 8.6.2.10 
It will be convenient to adjust d, so that 


[de dx - 1, 


from which the fixed classes are excluded. The rate of fixation and loss of 
the gene A, per generation at this state is then given by $(0)/4N, and $(1)/4N, 
and therefore 

AN, ào = $(0) + (1). 8.6.2.11 
This follows from 8.3.21 and 8.3.23. 


A numerical example will be given here. For weak selection favoring 
the dominant, N, s = c = 1/2; we get from 8.6.2.8, 


2N, Ao = 0.928 
and from 8.6.2.9, 
wg = do{T}(z) — 0.0058 T 1(z) — 0.0028T3(z) 
+ 0.0004 T1(z) + 0.00006 T 1(z) + °°}, 


inwhichTj(z) = 1, T1(z) = 3z, T(z) = (3/2) (5z? — 1), T3(z) = (5/2) 7z? — 32), 
etc. Values of d(x) at 0, 0.1, 0.2, ..., and 1.0 are listed in Table 8.6.2.1 and 
the curve is illustrated in Figure 8.6.2.1. They are adjusted by Simpson's rule 


Table 8.6.2.1. Frequency distributions at steady decay with 2N.s=1 and 
2N,s = —1. (From Kimura, 1957.) 


x 2Nes = 1 2N,s = —1 


0.0 0.688 1.389 
0.1 0.764 1.251 
0.2 0.838 1.142 
0.3 0.910 1.056 
0.4 0.977 0.990 
0.5 1.037 0.940 
0.6 1.088 0.903 
0.7 1.128 0.879 
0.8 1.155 0.865 
0.9 1.168 0.860 


1.0 1.166 0.866 
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Figure 8.6.2.1. Curves giving the frequency 
distribution of unfixed classes at the state 
of steady decay for 2N.s = 1.0, 0, and 
—1.0, where s is the selective advantage of 
dominants (4,A, and 4,42) over the 
recessive (42 A2). Ne is the effective size of 
the population. Rates of steady decay are 
indicated beside the arrows. 


so that the area under the curve is unity. ¢(1) + $(0) comes out 1.855, while 
4N, ào is 1.856. The agreement is satisfactory for this level of approximation. 
As a second example, we assume weak selection against the dominants: 
N,s = c = —1/2. 2N, Ag is 1.128 and values of $(x) are given in Table 8.6.2.1 
and are also illustrated in Figure 8.6.2.1. In this case ¢(1) + $(0) comes out 
2.254 while 4N, 4, is 2.256. Again the agreement is satisfactory. A most 
remarkable fact shown by the above examples is that, compared with the 
case of pure random drift, selection favoring the dominant allele (s > 0) 
decreases the final rate of decay, while selection against the dominant allele 
(s < 0) increases it. That this is always true for weak selection follows from 
8.6.2.8, since the most influential term —1/5c is negative if c(—N,s) is 
positive, and positive if c is negative. Formula 8.6.2.8 also suggests that 
2N,Ày has a minimum at about c — 0.9, when c> 0, after which 2N,A, 
starts to increase with increasing c. 
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Miller (1962) has shown that, when c is very large, 
cM 
Ned & 0.718 (5) : 8.6.2.12 


Furthermore, he worked out exact values of N,4, by numerical analysis to 
cover the range of 0 € c € 60 (Miller's « in his Tables 1 and 2 corresponds 
to c/2in the present notation). Table 8.6.2.2 and Figure 8.6.2.2 are constructed 
based on his results. (In Figure 8.6.2.2 the curve for small c is drawn based 
on the expansion 8.6.2.8.) 

For the continuous treatment to be applicable, the population number 
should be fairly large so that its reciprocal is negligible compared with unity. 
If the population is extremely small, we must treat the problem by the methods 
of finite Markov chains. Assuming that the effective number N, is equal to 
the actual number N, the transition probability that the number of 4, genes 
in the population becomes j in the next generation, given that it is į in the 
present generation, will be given by 


Pj= (^7 ea ELM 


(i,j — 0, 1, ..., 2N), 


8.6.2.13 


where x’ = x + 6x, in which x = i/2N and 6x is the change of gene frequency 
by selection per generation and is sx(1 — x)? if s is small. The rate of decay 


Table 8.6.2.2. Relation between c(— N,s) and 2N, Ao in the case of complete domi- 
nance, where s is the selection coefficient for the dominant. This table is constructed 
based on the values given by Miller (1962) in his Tables 1 and 2. 


c 2N. Ào c 2N. Ào 
0 1.0000 20 4.3886 
l 0.9120 22 4.6332 
2 1.0244 24 4.8668 
3 1.2688 28 5.3058 
4 1.5586 32 5.7142 
6 2.0900 36 6.0972 
8 2.5300 40 6.4594 
10 2.9118 44 6.8038 
12 3.2542 48 7.1326 
14 3.5676 52 7.4480 
16 3.8586 56 7.7512 


18 4.1312 60 8.0438 
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3.1 
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1.6 
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1.2 
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1.0 


0.9 
o 2.0 4.0 6.0 8.0 10.0 12.0 


N.S 


2 Nedo 


Figure 8.6.2.2. Relation between the rate of 
steady decay (Ao) and selection coefficient 
(s) in the case of complete dominance. Ne 

is effective population size. 


of the unfixed classes and their limiting distribution can be obtained by 
iteration. For example, if N = 4 and 2Ns = 1, the limiting form of the dis- 
tribution (fixed classes excluded) is shown in Table 8.6.2.3, with rate of decay 
(Ap) 11.875 %, giving 2NA, = 0.9500. If there is no selection (s = 0), it turns 
out that the rate of decay becomes 1/2N = 0.125 or 2NÀ, = 1.0. It may be 
interesting to note that with this magnitude of selection for dominants, the 
rate of decay is smaller than in the case of no selection. This agrees with 
the result of continuous treatment. 
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Table 8.6.2.3. Frequency distribution of unfixed classes at steady decay with N — 4, 
2Ns = 1, where s is the selection coefficient for dominants. (From Kimura, 1957.) 


CLASSES FREQUENCIES 96 
7A, +14: 14.80 
6A, 241 16.48 
541 +34: 16.32 
4A,+ AA 15.51 
34; + 5A; 14.28 
2A,+ 64; 12.64 
141-74: 9.97 
TOTAL 100.00 


8.6.3 Arbitrary Degree of Dominance 


We will consider a more general case in which the selective advantages of 
A,A, and A,A, over A, A; are s and sh respectively. 

The partial differential equation corresponding to 8.6.2.3 in the fore- 
going section is now 


0Ó 1 2? 


w curo — x)ó) 


5 8.6.3.1 
— s — {(h + (1 — 2h)x)x(1 — x)9). 
Ox 


As in the previous cases, the rate of steady decay is given by the smallest 
eigenvalue A), which can be expanded into a power series of c if c is small 
(Kimura, 1957): 


2N,Ao =1 + Kie + Koc? + Ky 9 Kíc* v ns, 8.6.32 
where 
1 1 2? x 3 

Ko ecu PR ee 

eS M ae OS ae Ee 

m 1 2: D? 

5 2x5 x7 55x77 ' 

1 7? 22 x 35 

KS n S AREE Ar a Le Saa a 4 
semna 2373555 eara ceo 


c= N,s, and D - 2h — |. 
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It may be easily seen that the above formula reduces to 8.6.1.12 if there is no 
dominance (A = 1/2), and therefore D = 0, provided that 2s is used instead 
of s to express the selective advantage of A,A,. For the case of complete 
dominance, D = 1 or —1 according to whether the mutant gene is dominant 
or recessive. In the former case of D = 1, 8.6.3.2 agrees with 8.6.2.8, 

The above notation for expressing the selective advantage of 4,4, and 
A,A, over A2 A, as s and sh may not be very convenient for overdominant 
genes, in which case it is preferable to take the heterozygote as standard and 
denote the selective advantages of two homozygotes, 4,4, and A, A, as —S, 
and —s,. The latter notation is related to the former by 


Spes 8.6.3.3 

52 =sh, xS 

D=2th-1= 5; 5;[* 8.6.3.4 
S — $1 


Thus in the symmetric case of s; = s: = $, c = N.s =0 and De = N,sD = 
NAs, + 52) = 2N, 5, and therefore 8.6.3.2 reduces to 


2^ x 3 
2N Am 1 E(N) LT = (Nes)? 


5 26 5 
$5404 — 
= 1 — 0.4(N, 5) + (0.05485 ---)(N, 5)? 

— (0.001462 - --)(N, s)? — (0.000263 ---)(N,s)* — 


(N,3)9* —--- 8.6.3.5 


The power series expansions 8.6.3.2 and 8.6.3.5 are only valid for small 
value of N,(s, + s2). On the other hand, for the case of overdominant genes, 
knowledge of the rate of decay for fairly large values of N (sı + 52) is required, 
and an important contribution has been made by Miller (1962) in evaluating 
the smallest eigenvalues for such a case. His work together with that of 
Robertson (1962) will be presented briefly in the following material. 


8.6.4 Overdominant Case 


Consider a pair of overdominant alleles A, and A,; if we designate by s, 
and s; the selection coefficients against the homozygotes 4,4, and A, A, 
in such a way that the rate of change of the frequency of A, by selection is 
given by 


(52 — (5; + sz)x])x(1 — x), 
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then the equation corresponding to 8.6.3.1 in the previous section becomes 


2 
e a 20-39) - zs — (si + sede xg), esaa 


where x is the frequency of A). 
Let 5 be the average of the two selection coefficients, 


51 +S, 


—$, 8.6.4.2 
2 
and let X be the equilibrium frequency of 4, in an infinite population, 


5 ^ 
2 = ¥, 8.6.4.3 





$1 + 52 
Then equation 8.6.4.1 may be expressed as 


2 
a ebat 


a 
ear ag C0 7200) — ANS = (2G — x)x(1 — x)9} 8.8.4.4 


or, denoting 2X — 1 by 2, i.e., 


X =(2+ 1)/2, 8.6.4.5 
we have 
0 ð? ð 
4N. nd Sa {x(1 — x)$) + 4N,5 2: {(2x — 1 — 2)x(1 — x)p). 86.4.6 


The smallest eigenvalue, Ay, of the above equation has been worked out 
by Miller (1962). Because of the symmetry involved between a pair of alleles, 
it is sufficient to study the case of 


£z0.5 
Or 
2 2 0. 


For a large value of c = N,s, and for the range of 1 > 2 2 0, Miller obtained 
the asymptotic expansion 


c ((1—2)& "79" (1428-0 
2N,À nce. dede yT) .6.4. 
'UTe5lsc1-235 ^ SeA) rud 
where 
so) 21411 1x3 1 


2x owt 8.6.4.8 


STOCHASTIC PROCESSES IN THE CHANGE OF GENE FREQUENCIES 411 


and 
2 Cari +} Co 
zmay 
E er a Jn 
in which 


and C;'s are given by the recurrence relation 
(1 TE 2 Citi = 22C; + Ci-15 


starting from 


In the special case of 2 = 0 (i.e., 5; = 52), we get 
(2 2de* 
US Jat OY! 


Table 8.6.4.1. Values of 2N, Ao as a function of c( 


is based on the values given by Miller (1962) in his Table 1. 


8.6.4.9 


8.6.4.10 


8.6.4.11 


8.6.4.12 


8.6.4.13 


— Ns: + 52)/2) and £. This table 





0.0 1.0000 1.0000 1.0000 
0.5 0.8136 0.8174 0.8292 
1.0 0.6532 0.6682 0.7130 
1.5 0.5172 0.5482 0.6408 
2.0 0.4038 0.4528 0.5982 
3.0 0.2358 0.3154 0.5516 
4.0 0.12984 0.2230 0.5082 
5.0 0.06770 0.15656 0.4550 
6.0 0.03362 0.10780 0.3964 
7.0 0.016004 0.07242 0.3384 
8.0 0.007362 0.04750 0.2840 
9.0 0.003294 0.03046 0.2352 
10.0 0.0014410 0.01918 0.19230 
11.0 0.0006192 0.01186 0.15562 


12.0 0.0002622 0.00722 0.12472 


1.0000 
0.8382 
0.7466 
0.7096 
0.7060 
0.7258 
0.7256 
0.6990 
0.6558 
0.6044 
0.5498 
0.4948 
0.4414 
0.3910 
0.3440 


1.0000 
0.8490 
0.7876 
0.7932 
0.8360 
0.9352 
0.9930 
1.0110 
1.0032 
0.9790 
0.9444 
0.9030 
0.8578 
0.8100 
0.7614 


1.0000 
0.8766 
0.8916 
1.0038 
1.1590 
1.4532 
1.6730 
1.8374 
1.9648 
2.0656 
2.1460 
2.2106 
2.2622 
2.3032 
2.3352 
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Also, if 2N,(s, + 5;(1 — €)? is large, formula 8.6.4.7 can be reduced to a 
simpler form: 


Ns. 4. Y 
2N,À, = ERN (2x(1 — £)?e- €? 4. 22(1 — Re 48}, 
8.6.4.14 


Miller also worked out by numerical analysis the eigenvalue for various 
values of c (20) up to c = 12. In Table 8.6.4.1, 2N, A, is tabulated as a func- 
tion of c and X based on the results obtained by Miller. Figure 8.6.4.1 is 


2.50 
2.40 
2.30 
2.20 
2.10 
2.00 
1.90 
1.80 
1.70 
1.60 
1.50 
1.40 
1.30 
1.20 
1.10 
1.00 


0.90 * 
0.80 x= 0.8 


x= 0.9 


2N do 


0.70 
0.60 -4 
0.50 

ee 
0.304 
2d 
0.10 


Q 








01234 587 B 9 10 11 12 
c 


Figure 8.6.4.1. 2N. A, for overdominant 
genes is plotted against (= Ns, + 52)/2) 
for various values of equilibrium gene 
frequency X. 
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drawn from the values in Table 8.6.4.1 for the cases of £= 0.5, 0.6, 0.7, 
0.8, and 0.9. One of the most remarkable features disclosed from the figure 
is that if s; and s; differ to such an extent that the equilibrium frequency £ 
is higher than 0.8 (or, because of symmetry, less than 0.2), overdominance 
accelerates rather than retards fixation, as compared with the neutral case. 
This was first discovered by Robertson (1962), who presented this fact in 
the form of a graph shown in Figure 8.6.4.2, where the term retardation 
factor is defined as the reciprocal of 2N, åo. According to him, selection for 
the heterozygote is a factor retarding fixation only if the equilibrium fre- 
quency € lies inside the range 0.2-0.8. For equilibrium gene frequencies 
outside this range, there is a range of values of N,(s, + 52) for which hetero- 
zygote advantage accelerates fixation and the more extreme the equilibrium 
frequency, the wider this range. However, for all values of € except O or l, an 
increase in the values of N,(s; + 52) eventually leads to a retardation. Table 
8.6.4.2 shows values of N,(s, + 52) which give retardation by a factor of 100 
for various equilibrium gene frequencies (Robertson, 1962). 


10? 60 60 
N(s, + 54! 

10? 

10! 


10° 


Retardation Factor 


107° 





0 0.5 1 
Equilibrium Gene Frequency 


Figure 8.6.4.2. Graphs showing retardation factor as a 
function of equilibrium gene frequency for various 
values of N(s; + 52). (From Robertson, 1962.) 


414 AN INTRODUCTION TO POPULATION GENETICS THEORY 


Table 8.6.4.2. Values of N.(s; + s2) which give 2N. Ao = 0.01 for various equilibrium 
gene frequencies. (From Robertson, 1962.) 


0.05 0.1 0.2 0.3 0.4 0.5 


bat 


Né(s1 + S2) 2100 490 110 45 22 15 


8.7 Change of Gene Frequency Due to Random 
Fluctuation of Selection Intensities 


So far we have considered stochastic processes of gene frequency change due 
to random sampling of gametes. In the present section, we will investigate a 
similar process due to random fluctuation of selection intensities. 

Such a fluctuation of selection intensities may be caused by random 
change in environment affecting large and small populations equally, as 
suggested by Fisher and Ford (1947) and by Wright (1948). It may also be 
caused by random fluctuation of genetic background associated with indi- 
vidual genes as a result of random sampling of gametes. If the number of 
segregating loci is much larger than the number of individuals in a population, 
the second cause may be quite important. However, in this section, to 
simplify the problem, we will assume that the population is infinitely large 
and the effect of random sampling of gametes may be neglected. 

Consider a pair of alleles A, and A, with respective frequencies x and 
| — xin the population, and assume that 4, has selective advantage s over A,. 
Then the amount of change in x per generation for a given value of s is 


ôx = sx(l1 — x). 
If s is a random variable having mean § and variance V,, then 

M,, = E(óx) = E(s)x(1 — x) = sx(1 — x) 8.7.1 
and 

V, = var(s)x?(1 — x)? = V,x*(1 — xy. 8.7.2 


With these expressions for M,, and V;,, the forward equation 8.3.1 becomes 


37955350053 $)- 5 G1 = 3) (0 « x « 1). 8.7.3 
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The solution of the above equation has not been obtained, but in the special 
case of $ = 0, namely, when gene 4, is selectively neutral on the long-term 
average, so that 


ð V, e 
1 x?(1 — x9) (0<x « 1), 8.7.4 
the solution can be obtained as follows (Kimura, 1954). 
Letting 
Vi 
u=4 exp [reta - xo 8.7.5 
and 
x 
€ = log, (=) (— œ < € < oo), 8.7.6 
—x 


and substituting these in 8.7.4, we obtain the heat conduction equation, 


Qu V, @u 
ôt 20tU 


8.7.7 


It is known that this equation has a unique solution which is continuous over 
—oo to +œ when t=O and which reduces to u(č,0) when t 20. The 
solution is given by 











f e E u(n, 0) dy. 8.7.8 


—awa 


1 
u(č, t) bas J2nV,t 


Therefore the required solution of 8.7.4, when the initial distribution of gene 
frequency is ¢(x, 0), is given by 


. mA 
X 
1 Pig 








p(x, t) = I fro £6 
T l^ 
J Vet {x( x)} 8.7.9 
( x(1 — D) 
1 og 
x | exp) — Ó— / X1 — y)óC», 0) dy, 


where exp {-} denotes the exponential function. 
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If the initial condition is not a continuous distribution but is a fixed 
gene frequency p, such that $(x, 0) = d(x — p), the above formula becomes 





[los Vou 
$(p, x; t) = l exp eee ai a 2 
B Vaki 2t 8.7.10 
[r(1 — p) Y 


* Tx — 3]? 


giving the probability density that the frequency of A, is x at the tth genera- 
tion, given that it is p at t — O. 

The change in the distribution curve with time is illustrated in Figure 
8.7.1, assuming p = 0.5 and V, = 0.0483. As the figure shows, the distribution 
curve is unimodal for a considerable number of generations (in this case, up 
to the 27th generation), after which it becomes bimodal. Generally, with 
an arbitrary positive value of V,, the distribution curve is unimodal if the 
number of generations is less than 4/(3V,) but becomes bimodal if it exceeds 
this value (still assuming p = 0.5). In the 100th generation, gene frequencies 
in our example which give maximum probability (corresponding to peaks in 
the curve) are approximately 0.0007 and 0.9993, where the height of the 
curve is about 11.37. As time goes on, the distribution curve appears more 
and more U-shaped, though it is never truly U-shaped, since its value at 
either terminal is always 0. This means that as time elapses the gene fre- 
quency shifts toward either terminal of the distribution (x =0 or 1) indefi- 
nitely and accumulates in the neighborhood just short of fixation or loss but 
never goes to fixation or loss completely (at least theoretically). To distinguish 
this from the fixation or loss in the case of random drift in small populations, 
the terms **quasi-fixation " and *quasi-loss" have been proposed (Kimura, 
1954). 

The same problem may be treated using a discrete model as follows: Let 
x be the frequency of A, in the fth generation and let s be the selection 
coefficient against its allele 4, such that the selective values of 4,4,, 4,4;, 
and A; A, are respectively 1, 1 — s, and (1 — 5). Then we have 


(t) 





x 
(e+ 1) _ 
x S sl — x) 8.7.11 
Or 
xt) x l 


airm = g * ———. 8.7.12 
px" ex 3125s 
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Figure 8.7.1. The change in the gene frequency distribution 
under random fluctuation of selection intensities. In this 
illustration it is assumed that the gene is selectively neutral 
when averaged over a very long period, that there is no 
dominance, and p — 0.5, V, — 0.0483. Abscissa: gene frequency 
x; Ordinate: probability density $; ¢ is time in generations. 
(From Kimura, 1954.) 


Taking the logarithm of both sides and putting 


x? 
Zz = los). 8.7.13 


equation 8.7.12 becomes 
Z141 = Z, — logl — s), 8.7.14 


where z,, as given by 8.7.13, is the logit of x“ and changes continuously 
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from —oo to +œ as x) changes from 0 to 1. The above relation 8.7.14 
leads to 


t 
2141 = zo — Y log,(1 — sj), 8.7.15 
i-0 


where s; is the value of s at the ith generation. 

If we assume that —log,(1 — 5) is a random variable which fluctuates 
from generation to generation with mean O and variance c?, and that its 
values in different generations are independent, then by the central limit 
theorem, the distribution of z, approaches a normal distribution with mean 
Zo and variance c?t as t gets large (cf. Dempster, 1955a). This means that 
in a finite length of time complete fixation or loss of the gene will never be 
realized. More generally, if —log,(1 — s) has mean m and variance c? per 
generation, the distribution of z, approaches a normal distribution with mean 
Zo + fit and variance c?t. 


8.8 Probability of Fixation of Mutant Genes 


8.8.1 Introductory Remarks 


The success or failure of a mutant gene in a population depends not only on 
selection but also on chance. This is because chance is involved in segregation 
at meiosis and in the number of offspring that each individual leaves to the 
next generation. Even for an advantageous gene, natural selection cannot 
insure its future survival, until individuals carrying that gene become suf- 
ficiently numerous in the population. 

It is important to know the probability of the ultimate success (i.e., 
fixation) of mutant genes, because the fixation of advantageous genes in the 
population is a key factor in the evolution of the species. 

The first significant contribution to this problem was made by Haldane 
(1927), based on a method suggested by Fisher (1922). This was followed by 
more detailed study by Fisher (1930, 1930e). They made use of the method 
which is now standard in the treatment of a stochastic process called the 
branching process. Equivalent results have been obtained by Wright (1931) 
from the study of distribution curves. The probability was also estimated 
for a recessive gene by Haldane (1927) and Wright (1942). Subsequently, 
more general results based on the continuous model (diffusion model) and 
including the case of any arbitrary degree of dominance were obtained by 
Kimura (1957). The probability of eventual fixation, u(p), was expressed in 
terms of the initial frequency p, the selection coefficients, and the effective 
population number. This function was used by Robertson (1960a) in his 
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theory of selection limits in plant and animal breeding. A still more general, 
but quite simple, expression for u(p) was obtained by Kimura (1962) in terms 
of the initial frequency and the mean and variance of the rate of change of 
gene frequency per generation. These results were obtained by using the 
Kolmogorov backward equation. 

Before discussing these more recent studies, we will summarize the 
branching-process method together with some of the main results obtained 
by Fisher and Haldane. 


8.8.2 Discrete Treatment Based on 
the Branching-process Method 


In this treatment, we assume a population consisting of an infinite but 
" countable" number of individuals and consider the absolute number of 
mutant genes in the population, rather than their relative frequency. Though 
the applicability of this method is restricted to simpler genetic situations 
(mainly genic selection and low initial frequency), it has the advantage of 
being somewhat more exact than the continuous treatment in the region 
where the frequency of mutant genes is very low. In a sense, this branching- 
process treatment is complementary to the continuous treatment which we 
will present in the next subsection in order to treat much more general cases. 
However, as we will see later, the results obtained by the former method for 
simple cases suggest that the continuous treatment may be quite satisfactory 
down to the extreme situation where a single mutant gene appears in a 
population. 

In the present section, the classical results obtained by Fisher and 
Haldane will be summarized. Suppose a mutant gene has appeared in a 
population. Let po, pi, P2, ... be the probabilities that the mutant gene will 
become 0, 1, 2, ... in number in the next generation, i.e., it will be represented 
by 0, 1, 2, ... descendants ($ (20 p, = 1). In particular, po is the probability 
that it will be lost by the next generation. Assume that 0 < p, < l. 

Instead of working directly on the probabilities, we will make use of the 
probability-generating function defined by 


oo 
f(x) = po + pix t pax? o = È P xt 8.8.2.1 
where the probability of leaving k mutant genes for the next generation is 


given as the coefficient of x*. 
The mean of this distribution is given by 


M, —f'(1) z? kp, - 8.8.2.2 
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Likewise the variance is given by 
Vy-f"0 +f) — (My. 8.8.2.3 


In the next generation, if individuals having the mutant genes reproduce 
independently so that each mutant gene again leaves 0, 1, 2, ... descendant 
genes with probabilities po, Pı, P2, ..., as in the previous generation, then 
it is not difficult to show that the probability-generating function for the 
number of mutant genes after two generations is given by 


SU) = Y pO). 8.8.24 


This is readily verified by expanding the expression and collecting the terms 
that are coefficients of the same power of x. The coefficient of x? is the prob- 
ability of no descendant now of the mutant gene two generations back, the 
coefficient of x! the probability of one, the coefficient of x? the probability of 
two, and so on. 

If this pattern of reproduction continues, the probability-generating 
function for the number of mutant genes after t generations (t = 1, 2, ...) is 
given by the tth iteration of the generating function f(x), namely, 


{IE SG), 


t times 


which may be denoted by f(x). In particular, f(x) = f(x). 
Thus 


f) 9 fU 69) = fiU Q)), 8.8.2.5 


(t = 2). The mean number of the mutant genes after t generations may be 
calculated from 


M, = fil) = fi- (SODS) = Mie x Mi, 
noting f(1) = 1. Thus 
M,-M!, 8.6.2.6 


as might be expected. 

In connection with the fate of mutant genes, the important quantity is 
the probability that the mutant gene will be lost by the tth generation. This 
is given by /,(0), which we will denote by v,. 

From 8.8.2.5, we have 


S(t) = via. 8.8.2.7 


Thus it may be seen that the probability of ultimate loss which is denoted 
by v and which is equal to v,, satisfies the equation 


f(v) =v. 8.8.2.8 
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Furthermore, it can be shown (Feller, 1950; Harris, 1963) that the prob- 
ability v is the smallest positive root of the above equation. From this, it is 
possible to prove that if 


M, —f'(1) <l, 


then v,, = l. That is, unless each mutant gene leaves on the average more 
than one descendant per generation, it is certain that the gene will ultimately 
be lost from the population. 

In order to get more detailed information on the process of elimination 
as well as the probability of survival for advantageous mutant genes, it is 
necessary to assume a particular form of distribution for the number of 
descendant genes. A simple but realistic assumption is that the values po, 
Pi» P2,--+ follow Poisson distribution such that 

£s 

Pk = ki e 5 8.8.2.9 
where c is the average number of descendant genes which a mutant gene 
leaves in the next generation. The probability-generating function 8.8.2.1 for 
this distribution is 


f(x) Sere, 8.8.2.10 
Let us put 
s = log. c; 


then s represents the selective advantage of the mutant gene if the population 
number is constant: The gene is disadvantageous if s < 0, neutral if s = 0, 
and advantageous if s > 0. For small absolute value of s, we have approxi- 
mately 


s=c-—l. 


In Table 8.8.2.1, the process of elimination is tabulated for some cases of 
disadvantageous genes, c = 0.90, 0.95, 0.97, 0.99, and also for a neutral gene, 
c = 1.00. In the latter case, the probability of survival is roughly equal to 
twice the reciprocal of the number of generations elapsed. It will be noted 
from the table that the process of elimination is quite rapid for c = 0.90. 

For advantageous genes (c > 1), the probability of ultimate survival, u, 
may be obtained by solving the equation 


l= u=", 8.8.2.11 


which is derived from 8.8.2.7, 8.8.2.10, and the definition 


u=l—v=l—vo- 8.8.2.12 
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Table 8.8.2.1. Probability of survival for deleterious and neutral genes, calculated 
from u, = 1 — e *"-!, uo = 1.0000. 


GENERATION c — 0.90 c — 0.95 c — 0.97 c — 0.99 c — 1.00 


0 1.0000 1.0000 1.0000 1.0000 1.0000 

1 0.5934 0.6132 0.6209 0.6284 0.6321 

5 0.1972 0.2319 0.2462 0.2607 0.2680 
10 0.0849 0.1186 0.1338 0.1499 0.1582 
20 0.0236 0.0487 0.0625 0.0787 0.0875 
50 0.0009 0.0076 0.0155 0.0287 0.0376 
100 —— —— 0.0028 0.0111 0.0193 
200 —— —— —— 0.0030 0.0098 
360 —— —— —— 0.0010 0.0065 
400 —— — — —— 0.0049 
500 —— —— — —— 0.0039 
oo 0.0000 0.0000 0.0000 0.0000 0.0000 


From 8.8.2.1], we obtain 


5 7 131 
u-25—.s^4-s? * 


3 9 — 540 $ B.8.2.13 


Thus, if s is small (s > 0), we have approximately 
u = 2s. 8.8.2.14 


That is, the probability of ultimate survival of an individual mutant gene is 
approximately equal to twice its selective advantage, if the advantage is 
small (Haldane, 1927). Table 8.8.2.2 shows for a few examples (c = 1.01, 1.03, 
1.05, 1.10, and 1.50) how the probability of survival changes with time. It 
may be seen from the table that for c — 1.50 the final value is approached 
very rapidly. If the gene survives the first five generations it is here to stay. 

The above treatment based on the branching-process method, though 
simple and intuitively appealing, has the drawback that it cannot be used 
to calculate the probability of fixation in a small population for genes with 
an arbitrary degree of dominance. The diffusion-equation method, as will 
be discussed below, can overcome this difficulty and enables us to obtain 
general but simple answers to many important problems in gene fixation, 
sometimes with remarkable accuracy despite the fact that it is essentially an 
approximation. It makes use of the Kolmogorov backward equation, first 
used in this context by Kimura (1957, 1962). 
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Table 8.8.2.2. Probability of survival for advantageous mutant genes, as calculated 
from u, —1 — e *"-!, starting with uo = 1.00000. 


GENERATION c — 1.01 = 1.03 c — 1.05 c — 1.10 c — 1.50 
0 1.00000 1.00000 1.00000 1.00000 1.00000 

] 0.63578 0.64299 0.65006 0.66712 0.77686 

5 0.27541 0.29015 0.30497 0.34212 0.60502 
10 0.16674 0.18424 0.20232 0.24928 0.58486 
20 0.09694 0.11707 0.13879 0.19763 0.58283 
50 0.04800 0.07266 0.10107 0.17713 0.58281 
100 0.03069 0.06047 0.09427 0.17613 0.58281 
200 0.02272 0.05781 0.09370 0.17613 0.58281 
300 0.02074 0.05768 0.09370 0.17613 0.58281 
400 0.02009 0.05768 0.09370 0.17613 0.58281 
500 0.01986 0.05768 0.09370 0.17613 0.58281 
cO 0.01973 0.05768 0.09370 0.17613 0.58281 


8.8.3 Continuous Treatment Based on 
the Kolmogorov Backward Equation 


In this section, we will treat the problem with much wider scope. Instead of 
restricting our attention to the probability of fixation of individual mutant 
genes, we will consider more generally the probability, u(p, t), that the mutant 
gene becomes fixed in the population by the tth generation, given that its 
frequency is p at t — O. Here it will be assumed that the population number is 
large enough so that the process of change in gene frequency is treated as a 
continuous stochastic process with good approximation. 

Then, as shown in Section 8.3, the Kolmogorov backward equation is 
applicable to the problem and we have (see 8.3.29) 


Ou(p, t) 
p ap , 





Ou(p.r) — Va, Ü^uCp, t) 


at 2 Op! + M; 8.8.3.1 


where Msp and Vs, are the mean and the variance in the change of gene 
frequency per generation, the gene frequency p being considered as variable. 

The required probability, u( p, t), may be obtained by solving the above 
partial differential equation with boundary conditions 


u(0, t) = 0, u(1l, t) — l. 8.8.3.2 
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In the simplest case of random drift in a finite population of effective size N, 
with no mutation and selection involved, we have 


F1 — p) 


Vor 2N, ' 


Map = 0, 


and 8.8.3.1 reduces to 


= 2 
Ou(p, t) _ p(l — p) u(p, D 8.8.3.3 
at 4N, | Op 





The pertinent solution of this equation is 


u(p, t) = p + Y Qi + Dp(1 — pX— 1) 
xs 8.8.3.4 
i(i * 1) 


x F(1— i, i +2,2, ple “ane, 





which agrees exactly with the result obtained as 8.4.12 by a different method. 
Let us now consider the ultimate probability of fixation defined by 


u(p) = lim u(p, t), 8.8.3.5 


t-* oo 


for which @u/dt = 0 and which therefore satisfies the equation 


Vsp d'u(p) du(p) _ 
EX dp? + Mop ap = 0, 8.8.3.6 








with boundary conditions 
u(0) = 0, u(1) = 1. 8.8.3.7 
The above differential equation may be written in the form 


a (uo 99)... 
dp dp] | Va’ 





P 


and it may be seen that the solution which satisfies the boundary conditions 
8.8.3.7 is given by 





[669 dx 
u(p) ==> 8.8.3.8 
I : G(x) dx 
where 
2Max x 
G(x) =e * Vox f 8.8.3.9 
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in which Ms, and V;, are the mean and the variance of the amount of change 
in gene frequency x per generation (Kimura, 1962). Formula 8.8.3.8 gives 
the (ultimate) probability of fixation of a mutant gene whose initial frequency 
is p. It has a pleasing simplicity and generality comparable to Wright's 
formula for the probability distribution of gene frequency at equilibrium 
(see Section 9.1 in the next chapter). His formula may be expressed in the 
present notation as 


(x) 


C 
=———__., 8.8.3.10 
Va G(x) 


where C is a constant determined such that 


[ 40 dx — 1. 
0 


It may be remarked that the above formula for u(p) is the steady-state 
solution of the Kolmogorov backward equation and is the counterpart of 
Wright’s formula for $(x), which is the steady-state solution of the forward 
equation. 

In a population consisting of N individuals in each generation (N not 
necessarily equal to N,), the chance of ultimate fixation of an individual 
mutant gene is given by u(p) with p = 1/2N; i.e., 


u — u(1/2N). 8.8.3.11 


However, if the population consists of different numbers of males (N*) and 
females (N**), a slight modification taking p =1/(4N*) or p= I/(AN**) 
may be required, depending on whether the mutant gene occurred in a male 
or in a female, as pointed out by Moran (1961) and Watterson (1962). Sub- 
sequently we will investigate several more special cases by applying these 
results. We will consider a population of sexually reproducing diploid indi- 
viduals and denote the mutant gene by A, assuming its initial frequency p. 

Let us start from the simplest case of genic selection in which A has a 
constant selective advantage s over its alleles in a population of effective 
size N,. If the frequency of A is x, then the mean and the variance in the rate 
of change in x per generation are 


Mg, = sx(1 — x) 
1- l 8.8.3.12 
pe 
2N, 
so that 2M,,/V;, = 4N. s, G(x) = e *"***, and we obtain from 8.8.3.8 
1 A= e tNesp 


u(p) = i-e e 8.8.3.13 
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This formula is a special case of formula 8.8.3.21 which we will derive later. 
A corresponding result has been obtained by Moran (1962, p. 118) for the 
haploid case. For |2N,s| < n, the right side of the above equation may be 
expanded in terms of 4N, s as follows: 


E -1)7! 
y 9X ) 


t 


u(p) = (AN, 5)" 


oo (2N y 8.8.3.14 
e? 
= p+ 2N,sp(l — p) * dea UD Iter, 


where ¢,(p)’s are Bernoulli polynomials. 
The probability of fixation of an individual mutant gene is obtained 
from 8.8.3.13 by putting p = 1/(2N). 


u = ( - exa — e 4") 8.8.3.15 


For a random-mating population with Poisson distribution of progeny 
number, N, = N and the above formula reduces to 
]-e -2s 


-ANs* 8.8.3.16 


y= 
1—e 


If |s| is small, we obtain 


25 
u= IIN 8.8.3.17 


l-e 
as a good approximation. This agrees with the results obtained by Fisher 
(1930) and Wright (1931). This formula is good even for negative s, though u 
for such a case is very small unless |Ns| is not large. For a positive s and very 
large N we obtain the result, already worked out in the previous section, that 
the probability of ultimate survival of an advantageous mutant gene is 
approximately twice the selection coefficient (see 8.8.2.14), that is, 


uz 2s. 8.8.3.18 


If N, differs from N, the corresponding formula derived from 8.8.3.15 is 
N, 
ux 2s( =e) ; 8.8.3.19 


In other words, the classical result u ~ 2s should be modified by a factor of 
N./N (Kimura, 1964). According to Crow and Morton (1955), estimated 
values of N,/N are 0.71 ~ 0.9 for Drosophila, 0.75 for Lymnaea, and 0.69 ~ 
0.95 for man. In natural as well as controlled populations, the effective size 
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N, will usually be different from the actual size N. Therefore, formula 8.8.3.19 
should be more useful than 8.8.3.18. 

On the other hand, if we let s —O in 8.8.3.15, we obtain u = 1/(2N) 
irrespective of the effective size N,. This is the special case of u — p, the 
result known for a neutral gene. 

We will now consider a more general case of zygotic selection. Let s 
and s/ be the selective advantage of the mutant homozygote and the hetero- 
zygote respectively. Then 


Ms. = sx(1 — x)[^ + (1 — 2h)x] 


x(1 — x) 8.8.3.20 
2N, ' 





Vex = 


and therefore 
_ 4-2cDx(1-x)-2 
G(x) =e cDx( x) i 


where c= N,s and D = 2h — 1. Thus we obtain 


[: e ^x —x)-2cx dx 
u (eee 
(p) n Uus. 8.8.3.21 


(Kimura, 1957). The rate of approach to the ultimate state of complete 
fixation or loss is given by the smallest eigenvalue A, of the equation 


Qu  p(1— p) d7u 
zl Wwe 1 — p){h + (1 — 2h)p} — 8.8.3.22 
a ANS dpi + sp(1— p){h + ( » 


In this case A, is equal to the smallest eigenvalue of the adjoint form 8.6.3.1 
and is given by 8.6.3.2 with c = N,s and D=2h — 1. 


For a completely recessive gene // = 0 or D = —1, and we have 
joe 2cx! dx 
u(p) — s 8.8.3.23 


TERE 
Jae 2cx1 dx 


If s is positive and small but N, s is large, the above formula leads approxi- 
mately to 


UM 2s 8.8.3.24 
nN 


for N, =N and p 2 1/(2N). This gives the probability of fixation of an 
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individual mutant gene which is advantageous but completely recessive in a 
randomly mating population consisting of N individuals each generation and 
distribution of progeny number following the Poisson distribution. It is 
interesting to note that the more exact value given here in 8.8.3.24 lies between 


SIN, the value estimated by Haldane (1927) in his treatment of this as a 
branching process, and ./s/2N, obtained by Wright (1942) with his method 


of integral equations. Also, Wright's numerical approximation 1.1 4/5/2N 
is very close to the present value. 

From 8.8.3.21, it is also possible to calculate the chance of fixation of a 
nearly recessive gene with a selective advantage in the homozygous state 
(Kimura, 1957). For example, consider a case with N,— N = 10? and 
s = 107*. If the mutant gene is completely recessive (A = 0), u + 0.8 x 102. 
With a slight phenotypic effect of h = 0.01 inthe heterozygote, u ~ 0.9 x 1077, 
while with A = 0.1, ux 2.3 x 107°. 

The function u(p) defined in 8.8.3.5 plays a very important role in the 
theory of limits in artificial selection put forward by Robertson (19602), who 
refers to u(p) as “ the expected limit," since if we assign value | to the pheno- 
type of the mutant homozygote, u(p) is the expected value with respect to 
this locus at the limit. Thus the total advance by selection is u(p) — p. If 
N,s is small in absolute value, we can expand u(p) in terms of N,s. For the 
case of no dominance as shown in 8.8.3.14, we obtain 


u(p) — p 2 2N,sp(1— p) t---. 


We note here that sp (1 — p) is the change of p in one generation by selection. 
This means that, for a character governed by additive genes, the total advance 
is 2N, times the change in the first generation, provided that N,s per locus 
is small. Similarly it has been shown by Robertson (19600) that, in the case 
of recessive character, the ratio of expected total response to initial change 
in the first generation is 2N, times (1 4- p)/3p. 

In most of animal- and plant-breeding theory, such as we have discussed 
earlier in the book, the major aim has been to maximize the rate of progress 
under selection. The selection limit theory inquires into the other aspect of 
the problem and asks such questions as “ Under what selection program is 
the highest ultimate performance attained?" In general the two questions do 
not have the same answer. 

The Kolmogorov backward equation can be extended to treat the prob- 
ability of joint fixation of mutant genes involving two or more loci. As an 
example, we will consider a case of two independent loci each with a pair 
of alleles, A, and A;, in the first locus, and B, and B, in the second. Let 
u(p, q; t) be the probability that both A, and B, become fixed in the popu- 
lation by the tth generation, given that their initial frequencies (at t = 0) are 
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p and q respectively. Then u(p, q; t) satisfies the following partial differential 
equation: 


0Ou(p,q;t)) 1. O0?u(pq;t) 1. O'u(p,q;t) 
LU d n tuc e ccc Lut c qui cC A 
ðr 297 ap 2 "^ ig? 
8.8.3.25 
du(P, q; t) Ou(p, q; t) 


The above equation is an extension of equation 8.8.3.1 (still assuming that 
the process is time homogeneous) and is the counterpart of equation 8.3.24. 
The probability of ultimate joint fixation defined by 


u(p, q) =lim u(p, q; t) 
t>o 
satisfies the partial differential equation 


1., Ou(pq) 1 , O'u(p, q) 
2a + en ag + Ms =0 





du(P, q) Ou(p, q) 
s~ 4M, 2: 


8.8.3.26 


and the boundary conditions that both u(p, 1) and w(1, q) reduce to the 
appropriate formulae for the probability of fixation at a single locus. 

In the simplest case of a haploid population with effective size N,, if 
the fitnesses of four genotypes are given as in Table 8.8.3.1, then equation 
8.8.3.26 becomes 


p(l — p)d*u a(1— q) 3u 
2N, Op? 2N, ôg? 
8.8.3.27 


Ou Ou 
+ p(1 — pY(s, + eg) — + 4(1 — a) (s; + ep) — = 0. 
Op ôq 


Table 8.8.3.1. Table of fitnesses of four genotypes measured in Malthusian 
parameters, taking the fitness of 4; B; as standard. 





FITNESS 
GENOTYPE (MALTHUSIAN PARAMETERS) FREQUENCY 
AiB, Sı t $2 +E Pq 
A,B 5i p — q) 
A2 B, $2 (1 LT pa 
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The general solution of the above equation has not been obtained; equation 
10.17 given in Kimura (1964, p. 228) is unfortunately erroneous. However, 
for the special case of s, = s, = 0, the pertinent solution 


] ep 2N,tpg 


u(p, 4) = — Hu 8.8.3.28 


was obtained by Kimura and it was checked through simulation experiments 
by Ohta (1968). Also, if |N,s,|, |N, 52], and |N,«| are all small, the approxi- 
mate solution of 8.8.3.27 is 


u(p, q) = pq(1 + N,5,( — p) + N,5(1 — q) + N,e(1 — pq) T je 
8.8.3.29 


Equation 8.8.3.28 may be used to study the effect of epistasis on the 
probability of joint fixation when mutant genes are individually neutral but 
become advantageous in combination. 


8.9 The Average Number of Generations Until Fixation 
of a Mutant Gene in a Finite Population 


Once the probability of fixation is known, it is natural to ask how long 
it takes for a mutant gene to reach fixation, excluding the cases in which it is 
eventually lost from the population. The basic theory to answer this question 
has been worked out by Kimura and Ohta (1969). They have shown that a 
single mutant gene, if it is selectively neutral, takes about 4N, generations 
on the average to reach fixation in a population of effective size N,. More 
generally, if p is the initial frequency of the mutant allele, the average number 
of generations until fixation (excluding the cases of loss) is given by 


1 
Nip) = [wu = WO} dé + TAP luu de, a 
where u(p) is the probability of ultimate fixation (see equation 8.8.3.8) and 
, [o6 ax G(x) dx 
NS ODD 


in which G(x) is given by 8.8.3.9 in the previous section. Similarly, the average 
number of generations until loss (excluding the cases of ultimate fixation) is 


(p) = D. a WEL — «(Y dé + [ WEL — ufu) dE. 8.9.2 
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In the simplest case of neutral mutations, formulae 8.9.1 and 8.9.2 reduce 
respectively to 


AO - {4N,(1 — p)log,(1 — p)} es 
and 


P 
=P 





io) = -aN (tos. r. - 


If the mutant allele is represented only once at the moment of its occurrence 
in a population of actual size N and effective size N,, we may set p = 1/(2N) 








UM 4 T T T E 1T t1 1I 
0 01 02 03 04 05 06 07 OB 09 10 
D 


Figure 8.9.1. The average number of generations until 
fixation, £,(p), as a function of initia! gene frequency, 
p. It is assumed that the gene is selectively neutral. In 
this figure, the theoretical values based on formula 
8.9.3 are represented by curves and those of Monte 
Carlo experiments by square dots. 2N. = 20 in the 
upper curve and 2N, = 10 in the lower one. (From 
Kimura and Ohta, 1969.) 
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in the above formulae. Thus, for a neutral mutation, we have 


: 1 1 
H/2N) = — svn, (1 - =i) HB = m] ~ AN, 8.9.5 


and 





4N N 
to(1/2N) = 2. log QN) = (5s tos. oo. 8.9.6 


For example, if N = 10* and N, = 0.8 x 10*, #, = 32000 and i, = 15.9. This 
shows that in general a great majority, 1 — 1/(2N), of neutral mutant genes 
which appeared in a finite population are lost from the population within a 
small number of generations, while the remaining minority, 1/(2N), spread 
over the entire population to reach fixation, taking a very large number of 
generations. For details, readers may refer to Kimura and Ohta (1969). 
Figure 8.9.1 shows some results of Monte Carlo experiments to test equation 
8.9.3. 

We may add here that for the special case of selectively neutral mutants, 
not only the mean value of the length of time until fixation, but also its 
standard deviation has been obtained. It was shown by Narain (1969) and also 
by Kimura and Ohta (1969a) that the standard deviation of the length of time 
until fixation of a neutral mutant is about (2.15)N, generations. Furthermore, 
for this case, the entire probability distribution of the length of time until 
fixation has been worked out (Kimura, 1969a). 


DISTRIBUTION 
OF 

GENE 
PRISE 
POPULATIONS 








n earlier chapters we have emphasized, as Haldane first did, that selection 

and evolution have both static and dynamic aspects. Chapter 5 treated 

those factors, principally selection, that change the population to 
produce evolutionary transformations—the dynamics of evolution. In 
Chapter 6 we considered the approximate balance between various opposing 
forces that keeps the gene frequencies somewhere near an equilibrium— 
the statics of evolution. 

Chapters 8 and 9 are analogous, but with the added complication that 
random factors are taken into account. Chapter 8 has been a stochastic 
treatment of the dynamics of evolution. This final chapter considers the 
interaction of the various factors, both deterministic and random, that lead 
to equilibria. However, we should expect to find instead of a single point of 
stable equilibrium a probability distribution. We will investigate the equilib- 
rium distribution of gene frequencies that is attained under the joint action 
of systematic factors that tend to bring the population to a fixed value and 
random factors that cause random scatter therefrom. 


433 
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The first attempt to deduce the gene frequency distribution was by 
Fisher (1922). This was clarified and developed in his 1930 book. In his 1922 
paper, Fisher compared such an investigation to the analytical treatment of 
the Theory of Gases. Starting in 1931, Wright has published a large series 
of important papers in which the steady-state distribution has been worked 
out under more and more general assumptions. Much of our knowledge 
of this subject, a grossly disproportionate share, is based on the work of 
these two pioneers. 


9.1 Wright's Formula for the Gene Frequency Distribution 


In Chapter 8, we studied the change of gene frequencies as a stochastic 
process. For a pair of alleles at a single locus we designated by @(p, x; t) 
the density of the transition probability that the gene frequency becomes x 
at the tth generation, given that it is p at t = O. In the following material we 
will designate by $(x) the density function of the steady-state distribution 
such that 


limó(p, x; t) = $(x). 9.1.1 


That is, the distribution is independent of the initial frequency p. For example, 
we have shown (8.5.10) that under linear pressure and random sampling 
of gametes we get 


F(ANm) 


P(x) = O(P, x; ©) = rc X DEAN m xp) 9.1.2 


x xiNIEEE TE. t. xp MM ae Ze 


which is independent of p. This independence may not be difficult to under- 
stand if we note that formula 9.1.2 represents the distribution at the state 
of balance between the loss of genes by random drift and their new production 
or introduction by mutation or migration. 

We will sometimes speak of ¢(x) as the frequency distribution since 
(x) dx gives the relative number of populations with gene frequency in the 
range of x ~ x + dx among the hypothetical aggregate of an infinite number 
of populations satisfying the same conditions. In the case of a single locus 
with a pair of alleles, a quite general formula for $(x) has been obtained by 
Wright (1938a). His formula may be expressed as follows: 





E C M, 
$(x) = V. exp(2 f V, ax), 9.1.3 
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where M,, and V,, are respectively the mean and variance of the rate of 
change in x per generation. C is a constant which is usually adjusted so that 


[ 669 dx = l; 9.1.4 
0 


that is, the area under the curve is unity. Wright's formula (9.1.3) may also 
be expressed as 


C 
g(x) = V, G(x)’ 9.1.5 
where 
-(2Mex 
G(x)=e ` “sx gs 9.1.6 


It is remarkable that Wright (1938) derived the formula from a simple 
consideration that at equilibrium the mean and variance are unchanged 
between successive generations. Later he showed that all the moments are 
indeed unchanged for this formula (Wright, 1952a). 

From our standpoint, however, it is more natural to derive this formula 
from 8.3.17 by imposing the condition P(x, t) = 0, that is, at equilibrium the 
net probability flux is 0 at every point in the interval (0, 1). This leads to 


] d 
5 dy (5 9) — Mio = 0. 9.1.7 


Let V4, =f; then 








ldf 2M, 
fdx Vy 

Or 
d 2M, 
dx es 7 Vc 


Integrating both sides of the last expression, we get 


2M 
bx dx 
Vsx 





log f = const + Í 


or 





M 
f=C exp(2 | V. ax), 9.1.8 
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which is equivalent to 9.1.3 if we note that @ = f/V,,. One of the tacit assump- 
tions involved here is that the process is time homogeneous so that M,, and 
V,, are independent of t. 

The equilibrium distribution discussed above may be called the flux-zero 
distribution, because the net probability flux is 0, i.e., P(x, t) = 0. 

On the other hand, there is an equilibrium distribution among mutant 
genes at different loci resulting from steady flux of mutations. In this case, a 
steady state is reached with respect to distribution of intermediate gene 
frequencies (0 < x < 1) but there is a constant probability flux from one 
terminal class to the other, i.e., P(x, t) 2 constant. We will treat some prob- 
lems relating to such a distribution later in this chapter (see Section 9.8). 

In the next few sections we will investigate, using Wright's formula 
(9.1.3), some concrete forms of the equilibrium distribution, assuming various 
conditions of mutation, migration, selection, and random sampling of 
gametes. 


9.2 Distribution of Gene Frequencies Among 
Subgroups Under Linear Pressure 


Consider a species subdivided into partially isolated groups, each of which 
exchanges individuals with the general population at a constant rate. Let x 
be the frequency of allele A, in a subgroup which exchanges individuals at 
the rate m per generation. If x, is the frequency of A, among the immigrants, 
the rate of change of x per generation is 


óx = m(x, — x). 9.2.1 


We will assume as we did in Chapter 6 that the immigrants representarandom 
sample from the entire species so that x, is constant and equal to the general 
mean X (see 9.2.6 below). Such a model was called by Wright (1951) the 
island model. We disregard selection, but the effect of mutation can be 
included in the parameter m (see 8.5). From 9.2.1, the mean of óx is 


M,, = m(x; — x), 
which may also be expressed as 
M,, = —m(l — xpx + mx,(1 — x). 9.2.2 


If N, is the effective size of the subgroup, the variance in óx due to random 
sampling of gametes is 


_ x(1 — x) 


V, 
bx 2 N, 


9.2.3 
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Thus 





ife dx = -AN,m(1 — xp) fh t AN, mx, |: 
= 4N,m((L — xr) log(1 — x) + x, log x} + const, 
and from 9.1.3 we obtain 
P(x) = Cx* a — a 7x71, 9.2.4 


where the constant C is determined by the condition 9.1.4: 
1 1 
| 469 dx =c | xtti o ymo dy 
0 o 
= C x BAN mx, , AN, m(1 — x)= 1, 
in which B(-, +) represents the beta function. Thus we get 


es 1 B T(4N, m) 
B(4N, mxr, AN m(1 — x)) | I(AN,mxpT(4N, m(1 — x;)) 


and 9.2.4 becomes 


dcs T(4N,m) 
ue T(4N, mxpT (AN, m(1 — xj)) 9.2.5 


x iia de, a, x) Nem ED 


where I'(*) represents the gamma function. Note that the above formula 
agrees with 9.1.2. Figure 9.2.1 illustrates the distribution curve y = $(x) for 
various values of N,m, assuming x, — 0.5. In this figure, the abscissa rep- 
resents the frequency of A, in a subgroup and the ordinate represents a 
corresponding probability density. As is shown in the figure, the distribution 
curve changes from U-shaped to bell-shaped as 2N,m changes from smaller 
QN,m < 1) to larger (2N,m > 1) values, passing through a flat shape at the 
intermediate value of 2N,m = 1. If 2N,m is less than unity, the curve is 
U-shaped and there is a considerable tendency for gene A, to be either fixed 
(x = 1) or lost (x = 0) within a subgroup. This tendency is larger, the smaller 
the value of 2N, m. On the other hand, if 2N, m is larger than unity, the curve 
is bell-shaped and the gene frequency within a subgroup tends to approach 
the general mean (xj), the tendency naturally being stronger the larger the 
value of 2N,m. Since 2N,m = ] means an exchange of one individual in 
every two generations, this means that the isolation has to be fairly strong 
between subgroups for the tendency of random fixation or loss to be 


predominant. 
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Figure 9.2.1. Distribution of gene frequencies 
among subgroups for various values of N. m. 
Here the gene frequency in immigrants is assumed 
to be 0.5 (x, — 0.5). Abscissa: gene frequency. 
Ordinate: probability density. (From Wright, 
1940.) 


However, in an actual situation, exchange of individuals may be re- 
stricted mainly to adjacent subgroups so that m in 9.2.5 should be replaced 
by m(1 — r), where r is the correlation coefficient of gene frequencies between 
adjacent subgroups. Since r can be very near to unity in such a circumstance, 
a considerable amount of exchange would still lead to appreciable fixation 
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or loss of an allele. Actually, a model of population structure in which the 
entire population is subdivided into colonies and migration in each generation 
is restricted to adjacent colonies (‘‘stepping-stone model") was worked out 
by Kimura and Weiss (1964) and Weiss and Kimura (1965). They determined 
the correlation coefficient of gene frequencies between two arbitrary colonies 
for one-, two-, three-, and higher dimensional stepping-stone models (see 
Section 9.9). 

For the distribution given in 9.2.5, the mean gene frequency over the 
whole species is 


X= [ xéCo dx = xj. 9.2.6 
0 


This agrees with the assumption that in the island model migrants represent 
a random sample from the whole species. The variance of gene frequency 
among subgroups is 


c z= (x — 3)!d(x) dx E EI 9.2.7 


as first given by Wright (1931). This was derived earlier by elementary 


methods (see 6.6.6). 
This can also be derived from the following procedure: Let x, be the 


frequency of A, in the tth generation; then 
X ep Hyp OX, 9.2.8 


where óx, has mean and variance given by 9.2.2 and 9.2.3, Using the same 
procedure we used in the study of pure random drift (compare Section 7.4), 
we get 


up d 1) = E(x, + ôx,) 
so that 44?) = x. In addition, we get 


te = E(x}, 1) = E{(x, + óx)!) 
(1 — x, 
= u$? + 2E, (x, m(X — x) + E, am x wu 
or 


— I s l 
Aus? = Imu PE — i2?) + v HA — u^). 
e 
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For t > oo, Au? — 0, j/ (9? = x, and we obtain 


X 1 


Thus 





c; = uf? — (uy = SNR = Bee) ; 9.2.10 
4N.m+1 4N.m+ 1 
which agrees with Wright’s formula (9.2.7). 

To derive the above formula for variance, we have assumed that the 
change in gene frequency by migration is small so that in calculating the 
sampling variance x(1 — x)/2N,, the gene frequency before migration may be 
used for x. On the other hand, if the amount of migration per generation is 
very large and if the sampling of gametes occurs after migration, it is more 
accurate to use the frequency after migration for x in calculating the variance. 
Thus, denoting by Ax the amount of deterministic change by migration and 
by € the amount of stochastic change by random sampling of gametes, we 
have E(£,) = 0 and 


X(1— X) 


E(&) = aN 9.2.11 


where X, = x, + Ax, — x, + mX — x) 2 mx - (1 — m)x, is the frequency 
after migration but before the sampling. Since x,,, = X, + £,, formula 
9.2.9 for uh is modified as 


X,((1— X 
uM T = E(X?) + E, | ( 2 


2N, 
9.2.12 


1 1 
ay ee 2) qc 
( AO + ON, E(X,), 


where E(X,) = mx + (1 — m)jr(? and 
E(X?) = (mx)? + 2mx(1 — m)? + (1 — my». 
At equilibrium in which 4? 2 x and uf'* P = uf? = uf? = o? + X2, we 


have E(X) 2 X and E(X2) = x? + (1 — m)?o2, and therefore 9.2.12 yields 


X 
2N 





Z Ies 
seg (1) + (1 o mene à 
or 


NE: x(1— x) 


2N, ies ny(1 21 ) 9.2.13 
2N, 
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Unless an unusually large value is assigned to m, the difference between this 
improved formula, 9.2.13, and the approximate formula, 9.2.10, is negligible. 

Assuming migration only, we may immediately adapt the above treat- 
ment to include reversible mutations. If there is mutation from 4, to its 
allele A, at the rate of u per generation and also reverse mutation from A, 
to A, at the rate v, then the rate of change in x due to these mutations is 
—ux + v(l — x). Thus 9.2.2 becomes 


Ma, = —[m(1 — xp + u]x + (mx, + v1 — x). 9.2.14 


This means that, in order to include mutations, we simply replace m by 
m + u + v and mx, by mx, + v in the various formulae. 

Going back to distribution 9.2.5, let us now investigate the frequencies 
of fixed classes or the probabilities that allele 4, is temporarily fixed or lost 
in the subgroup. 

For this, we note first that although the constant C in 9.2.4 was deter- 
mined by the condition 


[ 669 dx — 1, 9.2.15 
0 


from which 9.2.5 was derived, the study of the stochastic process involved 
(see formula 8.5.9) shows that C is intrinsically determined by the process 
and it is not a constant determined by the arbitrary statistical procedure of 
making its integral equal to unity. That is to say, condition 9.2.15 follows 
directly from the nature of the stochastic process. 

Thus it looks as if in the present treatment no probability mass is left 
to represent the fixed classes. It turns out, however (cf. Kimura, 1968), that 
the frequency f(0) that 4, is temporarily lost from the population is given by 


1 
SO) = [?" 63) dx. 2216 
0 
Thus, (AN, m) nonas 
e m e I 
LON T(4N, mx, + DF(AN,m( — xj) (=) EN 


approximately. This may also be derived by considering the balance between 
mutation and random extinction at the subterminal class, as suggested by 
Wright (1931). 

Similarly, the frequency f(1) that A, is temporarily fixed in the population 
(A, temporarily lost) is 


s= ob) dx 
172N 


I'(4N, m) ] VAN ma xp 9.2.18 
— 222/21. AL BURG M 
T(4N, mx) (AN, m(1 — xj) + 1) (zs) 
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9.3 Distribution of Gene Frequencies 
Under Selection and Reversible Mutation 


Let us assume a pair of alleles A, and A, with respective frequencies x and 
| — x in a random-mating population of effective size N,. Let u be the 
mutation rate from A, to A, and let v be the mutation rate in the reverse 
direction. Then the rate of frequency change of A, by mutation is 


—ux + v(l — x). 
If a is the average fitness of the population measured in Malthusian param- 
eters, then the rate of change in gene frequency owing to natural selection 
is given by 
x(1 — x) da 
2 dx. 





Note that if the fitness is measured in selective values (W), the corresponding 
rate of change is given by Wright's formula 
x(1— x) aW 
2W dx’ 





where W is the mean selective value of the population (cf. 5.2.12). Combining 
the expressions for mutation and selection, the mean rate of change in x per 
generation is 


x(1 — x) da 
2 dx 





Ms, = —ux + v(1 — x) + 9.3.1 


For a population of effective size N,, the variance in 6x due to random 
sampling of gametes is 
x(1 — x) 
V = ' 
óx 2N, 





Therefore 
May 
2 pa dx = 4N,u log(1 — x) + AN,v log x + 2N,à + const, 
óx 


and we obtain from 9.1.3 the distribution formula for the frequency of A,: 
g(x) = Ce rey m x)'ve 9.3.2 


In the simplest case of genic selection in which A, has selective advantage s 
over A,, we have 


@ = 2sx* + s2x(1 — x) = 2sx, 
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if we assign relative fitnesses 25, s, and O to A, 4,, 4, 4; , and A, A; respectively. 
Then 9.3.2 reduces to 


dix) = Ce ey Hale aye, 9.3.3 


In Figures 9.3.1a, b, and c the frequency distribution of a deleterious gene 


t 

' 10 
; * 40u 
1 
LI 








a 100 
40u 








(c) 


Figures 9.3.1a,b,c. Graphs showing the frequency distribution of a 
deleterious gene (s < 0) in a small (a), an intermediate (b), and a large (c) 
population, assuming equal mutation rates in both directions (u — v). 

In each figure, curves with solid, broken, and dotted lines represent cases 
with s = —u/10, —u, and — 10u, respectively. (From Wright, 1937.) 
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(s < 0) in a small population (a), i.e., Ne = 1/40u, an intermediate population 
(b) i.e., N. = 10/40u, and a large population (c), i.e., N, = 100/40u, is 
illustrated for three levels of selection intensities, assuming u — v. In each 
figure, curves with solid, broken, and dotted lines represent cases with 
S= —u[l0, —u, and —10u, respectively. As will be seen from Figure 9.3.1a, 
fixation or loss of alleles predominates in a small population and selection 
is not very effective in determining the gene frequency distribution. On the 
other hand, in a large population (c) the gene frequency tends to cluster 
around the equilibrium value and small change in selection intensity may 
lead to a marked change in the distribution. 

In the more general case of zygotic selection, if the relative fitnesses 
of three genotypes A,A,, 4,47, and A, A, are s, sh, and O, respectively, 
measured in Malthusian parameters, then 


à = sx? + 2shx(1 — x) 








Figures 9.3.2a,b,c. Graphs showing the distribution of a completely recessive 
deleterious gene (s <0, h = 0, u = v). Effective size Me = 1/40u, and 

10/40u, and 100/40u in a, b, and c, respectively. In these figures the solid 

line represents the least selection (s = — u/5), the broken line selection 10 
times as intense, and the dotted line selection 100 times as intense. (From 
Wright, 1937.) 
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and the distribution of x is given by 
(x) — Ce? Nesxà + 4Neshx(1 —X)y4Nev— I1 XN x)thea- 1 9.34 


Figures 9.3.2a, b, and c illustrate the distribution of completely recessive 
deleterious genes (s < 0, h = 0) for three different population sizes and for 
three levels of selection intensity. If there is overdominance between alleles, 
it Is more convenient to express the relative fitnesses of the three genotypes 
A,4,, A,A,, and A; A; as —5,, 0, and —s,, so that 


a = —s,x? — sy(1 — xy. 
Then the distribution formula for x becomes 

(x) ia Cem 2Nesix?~2Nes2(1 =x)? 4Nev— (1 xy ue 1 9.3.5 
Using a distribution formula equivalent to 9.3.4 or 9.3.5, Nei and Imaizumi 


(1966a) investigated the amount of genetic variation expected in a finite 
population at equilibrium. 


9.4 Distribution of Lethal Genes 


If gene A, is lethal, its distribution will be restricted to the range of very low 
frequencies and mutation from the lethal gene A, to its normal allele 4, 
may be neglected. Let x be the frequency of the lethal gene A,. Suppose that 
A, is completely recessive to its allele (43) so that the selective values of three 
genotypes are w,, — 0 and w,, = w22 = l. Then the rate of change in x by 
selection is 


x? 


dx 
The rate of change by mutation from A; to A, is 
vl — x). 


Thus, we can take 


2 


Ma, = uv(l — x) — 9.4.1 





l+x 


as the mean of 6x per generation. Using the binomial variance as before, 


UMS) 9.4.2 
x 2N, , 4. 


9.1.3 yields 


f(x) = C(1 — x2)? -x*Pe FL — x)*!, 9.4.3 


V; 
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, Ne = 10? 
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Figure 9.4.1. Probability distribution of the frequency of a lethal 
gene for various population sizes. The mutation rate (») is 
assumed to be 1075 per generation. Abscissa: Frequency of the 
lethal gene in the population. Ordinate: Probability density. 
(From Wright 1937.) 


where constant C is determined from 9.1.4 as 
C 2 
— B(2N,,2N,v) + B(2N,,2N,v + 4)’ 
where B(- , -) designates the beta function. Figure 9.4.1 shows the probability 
distribution of x among populations containing at least one lethal gene 
for various effective population sizes, N,. Since x is practically restricted to 
a very small positive value, we may substitute 


9.4.4 


(x) = Ce 2Nex?y4Nev~ I 9.4.5 
for 9.4.3. This formula can also be obtained by using 
M;, =v — x? 9.4.6 
instead of formula 9.4.1. The constant C is then 
2(2N, y^P" 
= TON, v) ’ 9.4.7 
approximately. The mean of the distribution is easily found and we obtain 
"m T(2N,v 4 4) ode 


J2N,T(2N, v) 
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At the limit of N,— oo, X becomes x v. This is the value expected in an 
infinitely large random-mating population and it can also be obtained from 


M, —D— x? = 0. 
On the other hand, if N,v tends to 0, 
X% v 27N, . 9.4.9 


It may be seen easily that the expected frequency of lethal genes is much 
lower in a small than in a large population. In Figure 9.4.2, mean lethal 
frequency in a population is plotted as a function of the effective population 
number, assuming v = 1075. The figure shows clearly that for the well-known 


relation X = Jv to be valid, the population number has to be hundreds of 
thousands, as pointed out by Robertson (1962). 





107* 





107? | 
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Figure 9.4.2. Average frequency of a completely recessive 
lethal gene in populations of various sizes where the 
mutation rate, v, is 1075. 
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If the lethal gene (A,) is not completely recessive in fitness but is slightly 
deleterious in the heterozygote, as is the case for the majority of lethal genes 
in Drosophila, we should take 


Ma =v — hx(1 —f) - (x'(1 f) + xf} wv — hx — x? 2 xf 9.4.10 


as the mean change of the lethal frequency per generation, still assuming that 
x is small. 

Here f denotes the inbreeding coefficient and we assume that it is small 
and constant. Using the approximate expression V;, = x/2N, for variance, 
the formula for the frequency distribution becomes 


(x) = Ce tNelht f)x- 2Nex? AN vu 1 9.4.11 


In this case, if the mutation rate and the heterozygous disadvantage are 
around v = 10^? and h = 1/40, respectively, as observed for the majority 
of lethal genes in Drosophila, the mean gene frequency is little influenced 
by the effective population number and is roughly equal to v/(h + f). 

More generally, if (h+ f) > Jo, the selection against lethal genes is 
primarily exerted in the heterozygous state so that 9.4.11 may be replaced 
by a simpler formula, 

d(x) = Ce SNP EI e mE 9.4.12 
where 

c LAN Ch + rne 

T(4N, v) , 


that is, the distribution is approximated by the gamma distribution with mean 
and variance 


U 
h+f 





x= 
and 
pen 
"o AN + fy 
In experimental studies of lethal genes, the directly observable quantity 
is the frequency of lethal-bearing chromosomes rather than individual lethal 
genes. So, let X, be the proportion of chromosomes carrying one or more 


lethal genes, and let Xy = 1 — X,. Then, assuming independent distribution 
of lethal genes at different loci, 


where x; is the frequency of the lethal gene at the ith locus. Thus, 


Q, = — loge Xo = È x 
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has a mean and variance 
E " U 
= x;=-eo 
Qi > i h +f 
and 
U 
2 2 
OQ, = 2,03, 7 >> 
a = LOST AN fy 
if we assume that (A +f) is the same for all relevant loci. In the above 
formulae, U = 5, v; is the lethal mutation rate per chromosome. 
As pointed out by Nei (1968a), since the frequency of lethal genes at 
each locus follows a gamma distribution, their sum Q, is again distributed 
as a gamma distribution, so that we have 


$(Q,) = Cem Net Deas a, 


where 
c AN. (o Y 
|». -T(4N,U) 
Figure 9.4.3 illustrates this frequency distribution for various population 
sizes, assuming U = 0.005, h = 0.025, and f = 0. 
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Figure 9.4.3. Frequency distribution of lethal chromosomes 
(Q,). Here it is assumed that each lethal gene has a hetero- 
zygous Selective disadvantage of 2.5% and that the lethal 
mutation rate per chromosome is 0.5%. The number beside 
each curve represents the effective population size. (From 
Nei, 1968.) 
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For more details on the distribution of lethal genes, readers may refer 
to Nei (19680). 


9.5 Effect of Random Fluctuation in Selection 
Intensity on the Distribution of Gene Frequencies 


In the preceding sections, we have considered only those cases in which the 
sole factor producing random fluctuation of gene frequencies is the random 
sampling of gametes. However, as pointed out in Section 8.7, it is probable 
that random fluctuation in selection intensity is also at work in determining 
the gene frequency distribution in natural populations. In this section, we will 
investigate this effect for the case of no dominance, that is, the heterozygote 
having fitness midway between the two homozygotes. For a more detailed 
treatment of the subject, see Kimura (1955). 

We will express the selective advantage of A, over A, by s, and assume 
that s is a random variable with mean § and variance V,. Let u and v be the 
mutation rates as defined in 9.3. Then the mean and variance of the rate of 
change in gene frequency x are given by 

Ma, = $x(1 — x) — ux + c(|! — x) 


x(l -> 5. 
Vin = Vx = xy S9. im 
Substituting these in 9.1.3, we get 
p(x) = Cxe- '(1 xine 1G, E yee "(x ed Ag) em l; 9.5.2 
where 
2N,V, + uA, — vA, 
7 EXP 
and 
IN, V, = vi, + ui, 
i A; 


(Kimura, 1955). Here 7, and 2, are given by 


Ni 
N,V, 


2 


in 
N.Y, 
er E (<0). 
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In order to compare the relative effect resulting from the two different 
random factors, i.e., the random sampling of gametes and random fluctuation 
of selection intensities, we will study in some detail the symmetrical case 
where § = 0 and u = v, With these assumptions, 9.5.2 reduces to 


(x) = C(x(1 — 7 (4, = x) (x - Ap m t 255 


If the population size is small such that 4N,u is much less than |, the genes 
are fixed most of the time and the distribution curves for unfixed classes are 
U-shaped, as shown in Figure 9.5.1a, where N, = 10?. If the variance 
of s is 0.1/2N, or 5 x 10^? the effect of the fluctuation is so small that, when 
drawn on the graph, the curve is indistinguishable from the one with no 
fluctuation. With V, = 10/2N, or 5 x 10^? the effect is still not striking; the 
frequencies of the subterminal classes merely rise by about 33% compared 
with the case of V, = 0. With V, = 100/2N, or 0.05, the subterminal classes 
become about 2.2 times as high as in the case where V, — 0. V, = 0.05 isa 
high fluctuation since ø, = JW; - 0.2236. Thus for small populations the 
effect of random fluctuation of selection is rather unimportant. 

Figure 9.5.16 shows the distribution of gene frequencies in intermediate 
populations such that 4N,u = 1. If N, =N, this means that one mutation 
appears every two generations on the average. For example, if we assume a 
mutation rate of one in one hundred thousand (10^ 5), N, should be 25,000, 
that is, 25 times as large as before. In such populations, if there is no fluctua- 
tion, there will be a flat distribution. That is, all the heterallelic classes are 
equally probable. Here the effect of the fluctuation may not be negligible. 
With V, = 10/2N, or 2 x 10^*, the distribution curve becomes U-shaped 
and the heights of the subterminal classes are about 4.9 times the heights of 
those where V, = 0. With V, = 100/2N, or 0.002, the distribution resembles 
that of a very small population (4N,u z 0), the frequency of the subterminal 
classes rising about 48 times as high as when V, — 0. 

In Figure 9.5.1c the population size is assumed to be twice as large as 
before (4N,u = 2), so that, if N = N,, one new mutation is expected per 
generation. With a mutation rate of u = 10 ?, N, should be 50,000. In this 
case it can be shown that if V, does not exceed 1/N, or 2 x 1075, the distribu- 
tion curve is unimodal. On the other hand, if V, exceeds this value, the curve 
becomes bimodal. As is shown in the figures, if V, = 10/2N or 10~4, the gene 
frequencies giving the two modes of the distribution are about 0.053 and 
0.947, respectively. With V, = 100/2N, or 10^?, the modal gene frequencies 
are about 0.5% and 99.5%, and the modal classes are about 104 times as 
frequent as the class with 50% gene frequency. Therefore in the last case the 
distribution curve looks as if it were U-shaped on the figure. However, thc 
class frequencies fall sharply outside these modes and the frequencies of 
terminal classes are 0. 
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Figures 9.5.1a,6,c. Effects of random fluctuation of selection intensity on 
the probability distribution of gene frequencies in a small (a), an 
intermediate (b), and a large (c) population. In these figures it is assumed 
that there is no dominance between a pair of alleles which are on the 
average neutral ($ — 0). Also mutation rates in both directions are equal 
(u =v). Ne stands for effective size of population and V, is the 

variance of the selection coefficient s. Abscissa: gene frequency (x). 
Ordinate: probability density. (From Kimura, 1955.) 
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9.6 Number of Neutral Alleles 
Maintained in a Finite Population 


In Section 7.2 we discussed the amount of heterozygosity and the effective 
number of neutral alleles maintained in a population of effective size N,. 
If the number of neutral alleles is potentially large enough that each mutant 
is a type not currently represented in the population, the probability of 
homozygosity at equilibrium was shown to be 


l 


Í> Nut 


9.6.1 

approximately, whether the population is monoecious or has separate sexes. 

N, is the inbreeding effective population number as defined in Section 7.6.2. 
The effective number of alleles maintained is 1/f, or 


n, — AN,u +l. 9.6.2 


If there are K alleles so that the chance of mutating to a particular one of 
them is u/(K — 1), then 


1 
4N 
ez = jJ +1 
f = ——————— 9.6.3 


K ] 
4 1 
Neu = j + 


As we said, in this treatment N, is the inbreeding effective number. 
Essentially the same formulae can be derived, using the variance effective 
number (see Kimura, 1968), which we now show. 

Assume that there are K possible allelic states 4,, 4,,..., Ag and that 
each allele mutates with a rate u/(K — 1) to one of the remaining K — 1 
alleles. The unit as far as the definition of state is concerned may be a single 
nucleotide, a codon, a cistron, or any other entity that can be regarded as a 
stable unit. Let x be the frequency of one of the alleles, say A,, and let X 
be its frequency after mutation. Then 


A= x+u (l — Kx), 








where u, = u/(K — 1). If we assume that random sampling of gametes is 
carried out after the deterministic change of gene frequency, then the change 
of gene frequency due to random sampling, which we denote by £, has mean 
and variance 


E) = 0, 


X(1— X 
B@ - 159. 
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In the above formula, N, is the “‘ variance" effective number of the popula- 

tion. Since the frequency of A, in the next generation is 
x-X-t-2u*(l-uK)x-£ 


if we denote by ui and ji; the first and second moments of the frequency 
distribution of A, at equilibrium, we have 


ji = E(x) =u, t (1 — u, K)pj, 9.6.4 
2 X(1— X 
Hy = EG") = BX 9 = zpe +S 
i ° 9.6.5 
1 , 
= 5y (u, + (L= uK} + (1 aet 28 = don 
t (1 —u,K)?p}}. 
Thus, from 9.6.4, we obtain 
T 9.6.6 
Hu K 6. 
Substituting 1/K for u, in 9.6.5, we obtain 
1 — 1)(2u, — ui 
ms + (2N, — 1)Qu, — u1K) "E 


K{2N, - QN, — 1)(1 — u,K)*}’ 
where u, = u/(K — 1). The average homozygosity or the sum of squares of 
the allelic frequencies may be expressed in terms of 5 as follows: 


K 


y= E(X x?) = Kyu, 9.6.8 
1 


in agreement with 9.6.3 when very small quantities are ignored. The effective 
number of alleles n, is equal to the reciprocal of H, and therefore n, = 1/(K u). 

If the number of allelic states is indefinitely large, K = oo so that u, = 0, 
u, K — u, and u?K = 0 in the above formulae and we obtain 


n, = 2N, — QN, — ly — uy. 9.6.9 


Note that N, here represents the variance effective number of the population 
rather than the inbreeding effective number that appears in 9.6.2. Since the 
mutation rate is usually very much smaller than unity (u < 1), 9.6.9 reduces to 


n, — AN,u +l, 9.6.10 


in good agreement with 9.6.2. 
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The problem of finding the distribution of the allelic frequencies is more 
difficult but was solved by Kimura and Crow (1964). For neutral alleles, the 
steady-state distribution has a rather simple form: 


$(x) = 4M(1 — x)? M" 1x7! 9.6.11 


(Kimura and Crow, 1964), where M = N,u. N, stands for the effective 
population number. 

In Chapter 7 (cf. Table 7.6.5.1) we showed that the two definitions of 
effective population number are equivalent if the population is neither 
increasing nor decreasing. In much of the discussion to follow, this assumption 
is made, so it is not necessary to distinguish between the two effective numbers. 

In this distribution d(x) dx gives an approximation to the expected 
number of alleles whose frequencies in the population lie within the range x 
to x + dx (0« x « 1). It is important to note here that we are considering 
a frequency distribution, within a single population, of various alleles having 
a different number of representatives, but we are not considering a probability 
distribution for any particular allele. 

Using the above distribution, f may be obtained by computing the sum 
of squares of allelic frequencies 


1 


1 
E 2 rc 
f=|x al ae TT 


which agrees with the result obtained in 9.6.1. Thus 


n, =7 = AN, l. 9.6.12 
This shows that the effective number of alleles is determined solely by the 
effective population number and is independent of the actual population 
number. 

The average (actual) number of alleles (n,) can be obtained by summing 
the expected number of alleles in each frequency class from 1/2N to 1. Thus 


1 1 
n, = J: (x) dx 2 4N. u if (1 — xytNe7 1x71 dy, ET 
2N 2N 


This shows that the average number of alleles within a population depends 
both on the effective and the actual population numbers. Therefore n, may 
be much more difficult to estimate from a sample than n, , and less informative 
as to genetic variability. 

Figure 9.6.1 illustrates the result of a Monte Carlo experiment on the 
number of neutral isoalleles in a small population consisting of 50 males and 
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50 females (N — 100), of which only 25 males and 25 females actually partici- 
pate in breeding (N. = 50). The simulation experiment was carried out 
using the IBM 7090 computer by generating pseudo-random numbers. In each 
generation, 100 male and 100 female gametes were randomly chosen from 
25 breeding males and 25 breeding females to form the next generation. 





Number of Alleles 
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o 100 200 300 400 500 600 700 800 900 1000 1100 1200 
Generation 


Figure 9.6.1. A result of Monte Carlo experiments on the number of 
neutral isoalleles. In this experiment actual population number N = 100, 
effective population number N., = 50, and mutation rate u = 0.005. The 
average number of alleles (n,) and the effective number of alleles (ne) are 
plotted by round and square dots respectively. Horizontal lines, solid and 
broken, represent corresponding theoretical values. For details, see text. 


Mutation to a new (not preexisting) allele was induced in each gamete with 
probability 0.005 prior to the formation of zygotes (u — 0.005). The initial 
population was set up such that it contained 200 different alleles. Outputs 
of both average and effective numbers of alleles were given at 50-generation 
intervals starting with generation 100, and the experiment was continued 
until generation 1200. The balance between mutation and random extinction 
of alleles was reached well before generation 100. The averages of 23 outputs 
are n, = 5.522 and f = 0.520 or n, = 1.923. These should be compared with 
the theoretical values obtained from equations 9.6.13 and 9.6.12, setting 
N = 100, N, = 50, and u = 0.005, that is, 


1 
n, Í , x^! dx = log, 200 = 5.298 
200 
and 


n, =4N,ut+1=2. 
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Thus, the agreement between the results of a Monte Carlo experiment and 
the theoretical predictions based on the diffusion approximation is satis- 


factory. For more extensive simulation experiments, readers may refer to 
Kimura (1968). 


9.7 The Number of Overdominant 
Alleles in a Finite Population 


Since Fisher's paper of 1922 it has been known that in an infinite population, 
overdominance leads unconditionally to stable polymorphism for a pair of 
alleles. For more than two alleles, however, a more delicate condition has to 
be satisfied at stable equilibrium (see 6.8). The complexity of the condition, 
however, does not vitiate the general conclusion that overdominance is a potent 
factor for maintaining selective polymorphism in an infinite population. 
As pointed out by Robertson (1962), selection for the heterozygote is a 
factor retarding fixation if the equilibrium gene frequency lies inside the 
range of 0.2-0.8, when a pair of alleles are involved (see 8.6.4). 

On the other hand, for an equilibrium gene frequency outside this range, 
there is a range of values of N,(s, + s2) for which heterozygote advantage in 
fact accelerates fixation. N, is the effective population number and s, and s; 
are selection coefficients against both homozygotes. This might suggest that 
if there are a large number of overdominant alleles in a population, they will 
be lost by random drift as readily as neutral alleles, and overdominance is 
rather ineffective in keeping a large number of them in a finite population. 

In this section we will investigate quantitatively the maximum number 
of overdominant alleles that can be maintained in a finite population. For 
this purpose, we will consider an ideal situation in which every mutation 
produces a novel allele which is different from the preexisting ones and every 
allele is heterotic with any other allele. Furthermore, we assume that all 
heterozygotes have equal fitness and all homozygotes also have equal fitness 
which is lower by s compared with that of the heterozygotes. Any asymmetry 
with respect to fitness within homozygotes or heterozygotes will reduce the 
number of alleles. Consider a random-mating population of effective size 
N,. We will designate by x the relative frequency of an allele in the population 
and let (x) dx be the expected number of alleles whose frequency is in the 
range x ~ x + dx. The relative frequency of each allele may change from 
generation to generation by mutation, selection, and random drift, but at 
statistical equilibrium there will be a stable distribution in x given by Wright's 
formula (cf. 9.1.3), 


vo ree, 


@(x) = V ax d 9.7.1 
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where C is a constant and M,, and V;, are the mean and variance of the rate 
of change in x. 

Let u be the mutation rate per gene per generation and denote by f the 
sum of squares of allelic frequencies that are contained in the population 


f= X xis 9.7.2 
i 


where x; is the frequency of the ith allele 4;. 

As in the previous section, we will denote by n, = 1/f the effective number 
of alleles maintained in the population. Since the rate of change of the fre- 
quency of a particular allele by mutation is — ux and that by selection is 


— sx(x — f), 9.7.3 
we have 
Mas, = —ux — sx(x — f). 9.7.4 


In the above expression, s(>0) is the selective disadvantage of a homozygote 
compared with a heterozygote measured in Malthusian parameters. For a 
discrete model, in which the selective values of a homozygote and a heterozy- 
gote are | — s and 1, respectively, the amount of change in x per generation 
is given by 


sx(x — f) 

l-sf ’ 

but since we are concerned with cases in which sf is much smaller than unity 

(0 < sf < 1), this also leads to the same expression as 9.7.3 if we neglect this 

small quantity in the denominator. The variance in the rate of change in x 
is given by 


9.7.5 


_ x — x) 


V, 
6x 2N, 


9.7.6 
Since we are considering a situation in which a large number of alleles are 
maintained in a population and the effective range of x is essentially restricted 
to a very low value (x < 1), we will use the approximate expression 





Var = Wi 9.7.7 
Using 9.7.4 and 9.7.7, we obtain from 9.7.1 the distribution formula 

D(x) = Cem 23607 /Y 74M» x71. 9.7.8 
where 


S= N,S, M = Nu. 9.7.9 


DISTRIBUTION OF GENE FREQUENCIES IN POPULATIONS 459 


In deriving this distribution formula, f is assumed to be a constant which 
is interpreted as the expected value of the sum of squares of allelic frequencies 
or, more simply, as the probability of allelism, the reciprocal of which is the 
effective number of alleles in a population. 

Such a treatment regarding f as constant (for a given S and M) is again 
an approximation, but as will be shown later, this turns out to be satisfactory 
for our purpose. 

Constant C in 9.7.8 is determined by the condition that the frequencies 
of all the alleles in the population add up to unity, 


X Xi =] 1, 
or 
1 
Í x(x) dx = 1. 9.7.10 
0 
Note that usually $ in Wright's formula (9.1.3) represents probability density 
of gene frequencies rather than the expected number of alleles as we use in 
this section. Thus, usually, C is determined by condition 9.1.4 rather than 


9.7.10. 
From 9.7.8 and 9.7.10, we obtain 


1 I i$ 
a=] e 23&- SP -4Mx ax. 9.7.11 
0 


This can also be expressed as 


1 «Mr a4$-x m 
m E pz: e ^dà4, 9.7.12 
where 


X 22/8 (r- =): 9.7.13 


At the equilibrium state in which random extinction is exactly balanced 
by mutational production of new alleles, we have the following condition 
at the subterminal class: 


2N =; (5) l (~) 9.7.14 
472 QN] 2N IN, vs 
—— M 
number of number of 
new mutants extinctions 


(cf. 8.3.22). This gives 


1 2 4M 
4M = Ce Gs 1) -5w. 9.7.15 
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Since u is very small compared with unity, the term (4M/2N) may be neglected 
in the actual calculation of the exponential term in the above formula. 
Furthermore, we assume that the effective number of alleles is much smaller 
than the total number of genes in a population, i.e., n, = 1/f < 2N. Then the 
above relation 9.7.15 may be reduced to a simpler form, 


4M = Ce 757, 9.7.16 
Combining 9.7.12 with 9.7.16 to eliminate C, we obtain 


x2 24S-X 2 
a = rf e 2 dA, 9.7.17 
-x 


where 
2M 
r = — = 
JS 
(Kimura and Crow, 1964). Thus, for given values of M(=N,u) and 
S(=N,5), the value of f may be obtained by first solving 9.7.17 for X and 


then obtaining f from 9.7.13. Then the effective number of alleles (n,) is 
obtained by 1/f. 


Unless S is small, 2./S — X in the integral of 9.7.17 may be replaced 
with good approximation by oo to give 


9.7.18 


r Pega 9.7.19 


In deriving the above relations, one of the essential assumptions we have 
made is that f in equation 9.7.8 is a constant which represents the expected 
value of the sum of squares of allelic frequencies, i.e., 


f-E (Y x4). 9.7.20 
i 
So, let us examine the corresponding value evaluated from the integral: 
1 1 ; 
J = | x?@(x) dx = cf xg 299 fV "4 Mx 9.7.21 
0 0 
Noting 9.7.12 and 9.7.16, it can be shown that 


M 2M 
J=f- z e 25172615). 9.7.22 
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Thus, if 
M 2M 
f?» = e728(1-2s +=) 9.7.23 
or equivalently, if 
u Q gfe S 17 27)*4M 9.7.24 
we have 
1 
Í x^d(x) dx = f 9.7.25 
0 


with sufficient approximation, and f in 9.7.8 indeed represents the expected 
value of 


ere 


Therefore, under this condition, the present formulation is satisfactory for 
estimating this expected value by solving for f using 9.7.17 or 9.7.19. 

In the above treatments we have studied the effective number of alleles, 
n, , defined by 


1 
Nne = F 9.7.26 
This is different from the actual number of alleles, the expectation of which 
may be obtained from 


1 
1-— 

n, = f i N(x) dx. 9.7.27 
ZN 


In the latter, the rarest allele is counted as equivalent to a more common type 
when the allele number is counted. Therefore, ne € n,. For a discussion of 
actual and effective number of alleles, see Section 7.1. 

The segregational load due to overdominant alleles is given by 


|^ 


L,-sf- 9.7.28 


3 


e 


In Table 9.7.1, the relation between r(— 2M IVS) and fJS is tabulated 
for X (considered as a parameter) between 3.0 and —3.0, using the relation 
9.7.19 and 


Kiat, a729 
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Table 9.7.1. Quantities needed for computing the 
effective number of alleles. 





2M Ne - 
s ( v 5) VS ivs 
3.0 0.00444 0.666 1.50 
2.7 0.0105 0.738 1.36 
2.5 0.0176 0.794 1.26 
2.2 0.0360 0.894 1.12 
2.0 0.0553 0.973 1.03 
1.7 0.0984 1.11 0.899 
1.5 0.139 1.22 0.819 
1.25 0.204 1.38 0.727 
1.0 0.288 1.55 0.644 
0.75 0.389 1.76 0.570 
0.5 0.509 1.98 0.505 
0.25 0.646 2.23 0.448 
0 0.798 2.51 0.399 
—425 0.964 2.80 0.357 
—0.5 1.14 3.12 0.321 
—0.75 1.33 3.46 0.290 
—1.0 1.53 3.81 0.263 
—1.25 1.73 4.18 0.239 
—1.5 1.94 4.56 0.219 
—1.7 2.11 4.88 0.205 
—2.0 2.37 5.36 0.187 
—2.2 2.55 5.69 0.176 
—2.5 2.82 6.19 0.161 
—2.7 3.00 6.60 0.151 
—3.0 3.28 7.11 0.141 


which is derived from 9.7.13. Here the numerical calculation is facilitated by 
the observation that r is equal to the ratio between the height of the normal 
curve (with mean O and unit variance) at X and the area under the curve 
from — X to +œ. In Figure 9.7.1, the relation between 2M/./S and nA fS 
(ie., the reciprocal of KS) is shown graphically. For example with 
N,= 10°, s= 107°, and w=1075, we have S = 100 and M = 1 so that 
r =2M/,/S = 02 and the graph gives n,/,/S œ 1.35 or n, = 13.5. 
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Figure 9.7.1. Graph showing the relationship between r — 2MIV. S and 
nV. S, where n, is the effective number of alleles. 


For values of r outside the range tabulated, the following approximations 
will be found useful: 
(1) For a small value of r, use 


l 0.4 
fJS =; Jis legi ( 5A. 9.7.30 


This is derived from the two approximate relations, 





1 x? 


doa ^ 





r= 


and 
X 
A'S-7. 


which hold when X is large and r is small. For example, with N, — 105, 
$5 — 0.1, and u = 0.5 x 1075, we have J/S = 100, 2M = 1, so that r = 0.01. 
Then formula 9.7.30 gives FS = 1.36. We get practically the same value at 
this level of approximation from interpolation in Table 9.7.1. The effective 
number of alleles in this case is approximately 73.6. 

(2) Fora large value of r, use 


INE = > (1 — a) 9.7.31 
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This is derived from 9.7.29 and the relation 


1 
r= |X| tu 4e, 


IXI |X? 


which holds for a large negative value of X, for which we have 


o0 a2 1 
f e z dì = — e 
IX| 


Pd 1 3 1x3x5 E 
|X| 


— — + — — ———— 
Ix? |x|? [x|° 


Already at r = 3, 9.7.31 gives L/S = 0.148, a quantity very near to 0.151 
which we get from Table 9.7.1. 
Formula 9.7.31 may also be expressed in the form 


l 
n, = 4M ( + 3) ; 9.7.32 
r 


This can include the limiting case of s = 0 and therefore r = oo (i.e., neutral 
mutation). In this case, we get 


n, =4M. 9.7.33 


Actually in this special case of neutral mutation, it was shown in the previous 
section that 


n, — AM +1. 9.7.34 


Since we are only concerned with the case of large n, (or small f), formula 
9.7.33 is sufficient for our purpose. Formula 9.7.32 is interesting in that it 
suggests that if r is large, or 2M is much larger than JS, selection plays only 
a minor role in maintaining a given number of alleles. In such a case, over- 
dominance increases the number of alleles only about 100/r? ?7. For example, 
in a population of N, — 10?, roughly 400 alleles may be maintained with the 
unusually high mutation rate of u = 107°, if the alleles are neutral. With the 
development of 1% overdominance (s = 1072), only 10 more alleles will 
be added. With overdominance of 10% (s 2 0.1), 70 more alleles are added 
(cf. Table 9.7.1). Even with such a high overdominance, s = 0.5, the number 
of alleles added is roughly 200. No overdominance but a 50% increase in 
mutation rate can do the same. Thus, overdominance is extremely inefficient 
as a factor maintaining a large number of alleles in a population. 

Figure 9.7.2 illustrates a Monte Carlo experiment performed to test 
the theoretical treatment of the number of overdominant alleles. The experi- 
ment was performed by using the IBM 7090 computer to simulate a popula- 
tion consisting of 50 breeding individuals in which one mutation occurred 
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in each generation (u = 0.01, N, = N = 50). The selection coefficient against 
homozygotes was s = 0.] and the population contained 100 different alleles 
at the start (Oth generation). 

In the figure, experimental outcome on the effective number of alleles 
is shown from generation 60 through generation 400. Averaged over 18 
outputs, the observed n, is about 4.7, while the theoretical value for n. 
(horizontal line in the figure) which is obtained from 9.7.17 is n, — 4.19. 
So the two values for n, agree fairly well. 
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Generation 


Figure 9.7.2. Monte Carlo experiment on the number of overdominant 
alleles, s = 0.1, N= N. = 50, u = 0.01. 


One of the most interesting applications of formula 9.7.17 is reported 
by Kerr (1967), who studied the number of sex-determining alleles in bee 
populations in Brazil. These alleles are haplo-viable but homozygous lethal 
so that among females they are overdominant with s = 1. In the population 
of Piracicaba, having an independently estimated effective population number 
(N,) of 428, the observed number of alleles is 11.0 ~ 12.4, while the cor- 
responding theoretical value derived from the formula of Kimura and Crow 
(1964) is 11.2, if we take the mutation rate u = I0 ?, and it is 10.9 if u = 1076. 
So the number of alleles actually observed and the corresponding theoretical 
value are in good agreement. For the details, see the original paper by Kerr. 
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9.8 The Number of Heterozygous Nucleotide Sites 
Per Individual Maintained in a Finite Population 
by Mutation 


In the previous section, we investigated the number of alleles maintained 
at a single locus, assuming that the number of possible allelic states is so 
large that each new mutant is a state not preexisting in the population. 

In the present section, we will consider a different model and will 
investigate the number of heterozygous nucleotide sites per individual. We 
will assume that the total number of nucleotide sites per individual is so large, 
while the mutation rate per site is so low, that whenever a mutant appears 
in a population it represents a mutation at a site in which a previous mutant 
is not segregating. Such a model is also appropriate if we regard as a site a 
small group of nucleotides such as a codon rather than a single nucleotide. 
To treat this model, we will use the Kolomogorov backward equation as 
explained in Chapter 8, Section 3. 

Let us assume that on the average in each generation mutant forms 
appear in the population in v, sites. Consider a particular site in which a 
mutant has appeared. Let $(p,x;t) be the probability density that the 
frequency of the mutant in the population becomes x after t generations, 
given that it is p at the start (1t = 0). Then ¢ satisfies the Kolmogorov back- 
ward equation (see equation 8.3.2), 


O$(p,x;t) |., 0'ó(p,xi!) 
Er xEG 1 ape 


OQ p, x5t 
M, 9G x30 9.8.1 


Op 
In the above equation, Msp and V,, stand for the mean and the variance of 
the change of the mutant frequency per generation. For example, if the 
mutant in each locus has selective advantage s in homozygotes and sh in 
heterozygotes over the preexisting form, we have 


Msp = sp(1 — p)i^ +(1 — 2h)p}, 9.8.2 


and, if random fluctuation in mutant frequency is solely due to random 
sampling of gametes, 


_ pl - p) 
Po 2N, ! 


e 


V; 9.8.3 
where N, is the variance effective number of the population. 

In the following treatment, we will assume that Ms, and V;, are the 
same for all the sites. Since a mutant form which appears in a finite population 
is either lost from the population or fixed in it within a finite length of time, 
under continued production of new mutations over many generations, there 
will be a stable distribution of mutant frequencies in different segregating 


DISTRIBUTION OF GENE FREQUENCIES IN POPULATIONS 467 


sites. We will denote such a stable distribution by ®(p, x), which is expressed 
in terms of ó(p, x; t) as 


Q(p, x) = v, f 0. x5 I) dt, (0 « x « 1). 9.8.4 


This distribution is interpreted to mean that ®(p, x) dx stands for the expected 
number of sites having mutants in the frequency range x to x + dx. This may 
be seen as follows. Since new mutant forms appear in each generation in Vm 
sites, Ym Ọ(P, x; t) dx represents the contribution made by the mutants which 
appeared t generations earlier with initial frequency p to the present fre- 
quency class x — x 4- dx. Thus, adding up all the contributions made by the 
mutations in the past, the expected number of sites having mutants in the 
frequency range x ~ x + dx at the present generation is 


[vn f c. x; 1) ai dx. 


Let us now consider the number of heterozygous sites-per individual. 
Assuming random mating, the proportion of heterozygotes in a site having 
the mutant with frequency x is 2x(1 — x). So, the expected number of hetero- 
zygous sites per individual is 


H(p) = Ji 2x(1 — x)®(p, x) dx 
9.8.5 


=v, f 2x0 — x) dx f. 0. x; t) dt. 


In order to obtain an equation for H(p), we multiply each term of the back- 
ward equation 9.8.1 by v,,2x(1 — x) and then integrate each of the resulting 
terms first with respect to x over the open interval (0, 1) and then with respect 
to t from t = 0 to t = oo. This leads to 


E faxa $ sO) d 5 V $^ H(p) + My, = H(p) 
J, s s f 2 X)O(P, x; eG es eee gg P 


9.8.6 
The left-hand side of this equation becomes 
1 [| 
zi 2x(1 — x)ó(p, x; 00) dx — Í 2x(1 — x)$(p, x; 0) ax}, 
0 o 
which is further reduced to —2v, p(1 — p) if we note that 
$(p, x; 00) =0 98.7 


for 0 <x <1, and 


$(p, x; 0) = d(x — p), 9.8.8 
where 6(:) is the Dirac delta function. 
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Of the above two conditions, the first, i.e., 9.8.7, follows from the fact 
that the mutant form is either fixed or lost in a finite time, while the second, 
i.e., 9.8.8, is based on the assumption that the initial frequency of the mutant 
is p. Thus, we obtain the ordinary differential equation for H(p), 

4V;,H"(p) + Mapt’ (p) + 2v, p(1 — p) = 0. 9.8.9 
Furthermore, since * mutations" at p = 0 and p =! do not contribute to 
the segregating sites, we have the boundary conditions 


H(0) = H(1) = 0. 9.8.10 


The solution of 9.8.9 which satisfies the boundary conditions 9.8.10 is 


HG) = ( — up) [Cu dE + wf WOU- u@)} dé, sam 


where 
i 4 G(x) dx 
Wal) = 4v, (1 — C) Yao 9.8.12 
and 
[1669 ax 
u(p) = = 9.8.13 
I, G(x) dx 


(see 8.8.3.8) is the probability of ultimate fixation in which 
x(M 
TOES B uri 
In the special case of no dominance in which the mutant has selective 


advantage s in homozygotes and s/2 in heterozygotes over the preexisting 
form, 9.8.2 becomes 


S 
Ms, = 5 pl — p) 


and, if we combine this with 9.8.3, i.e., Vs, = p(l — p/(2N,), we have 
G(x) 2e 9*,  u(p) =(1 — e "(1 — e775), and  yg(£) = AN, v, e288 
x(1 — e 7?5)/S, so that we obtain 


4N, v, [1 — e 75? 
^) 


S l—e 2s 


H(p) = 





9.8.14 


where S = N,s. 
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In a population consisting of N individuals, if the mutant in each site 
is represented only once at the moment of its occurrence, p = 1/(2N) in the 
above formulae. Thus, for the case of no dominance, we have 


s 
4NeYm{ 1— eÑ 1 
H(1/2N) mem peo 9.8.15 


This reduces to 
N 
H(1/2N) = 4y,, N 9.8.16 


if the mutant form is advantageous such that 2S = 2N,s » 1. On the other 
hand, if the mutant is deleterious such that 2N,s' > 1, in which s' = — 5s, 
9.8.15 reduces to 


2v 
H(1/2N) z —. 8. 
(1/2N) Ng’ 9.8.17 
Finally, if the mutant is almost neutral such that |2N,s5| < 1, we have 
N 
H(1/2N) = Va 9.8.18 


These results suggest that mutant genes with definite advantage or disadvan- 
tage cannot contribute greatly to the heterozygosity of an individual because 
advantageous mutations occur rarely (as shown by small y,, in 9.8.16) and 
deleterious genes are eliminated rapidly (as shown by large Ns’ in 9.8.17). 


9.9 Decrease of Genetic Correlation with Distance 
in the Stepping-stone Model of Population Structure 


Usually, a total population forming a species is not a random-mating unit, 
because the distance of individual migration is usually much smaller than 
the entire distribution range of the species. This phenomenon, which Wright 
called ** isolation by distance " (Wright, 1943), will lead to local differentiation 
of gene frequencies due to random genetic drift. Wright considered a model 
in which a population is distributed uniformly over a large territory, but 
the parents of any given individual are drawn at random from a small 
surrounding region. The size of the neighborhood, that is, the population 
number in such a surrounding region, plays a fundamental role in his analysis. 
He studied, by his method of path coefficients, the pattern of change in the 
inbreeding coefficient of subgroups relative to a larger population in which 
they are contained (Wright, 1940, 1943, 1946, 1951). He has shown that the 
tendency toward local differentiation is much stronger in a linear than in a 
two-dimensional habitat. 
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The problem of local differentiation may also be studied in terms of 
change in correlation with distance, as considered by Malécot (1955), since 
individuals living nearby tend to be more alike than those living far apart. 

In natural populations, individuals often are distributed more or less 
discontinuously to form numerous colonies, and individuals are exchanged 
mainly between adjacent or nearby colonies. To analyze such a situation, 
Kimura (1953) proposed a model of population structure which he termed 
the “‘stepping-stone model." A successful mathematical treatment of the 
model for the case of an infinite number of colonies was presented by Kimura 
and Weiss (1964) up to three-dimensional cases. A more general treatment 
that can cover any number of dimensions was also presented by Weiss and 
Kimura (1965). Earlier, Malécot (1959) studied a similar model and obtained 
the asymptotic solutions for one- and two-dimensional cases. 

In the present section, we will consider mainly a one-dimensional 
stepping-stone model with an infinite array of colonies (see Figure 9.9.1). 


m, mi, 
2 2 
— = i; 


1—1 I 1+1 


Figure 9.9.1. One-dimensional stepping-stone model. 


Let us consider a pair of alleles 4, and 4, and let p; be the frequency 
of A, in the ith colony in the present generation before migration. We assume 
that to go to the next generation, long-range migration, short-range migration, 
and random sampling of gametes occur in this order. 

In long-range migration, we assume that each colony exchanges individ- 
uals with a random sample taken from the general gene pool with constant 
gene frequency p, at the rate Me per generation. The long-range migration 
is formally equivalent to mutation or, more generally, any linear systematic 
evolutionary pressure. For example, if there is mutation between A, and A, 
in addition to long-range migration, M should be replaced by u +v + m, 
and imi, p, by v + m, p; in the following treatment, where u is the mutation 
rate from A, to A, and v is the mutation rate in the reverse direction. Thus, 
by long-range migration, the gene frequency changes from p; to 


(1 — m,)p, + m, pr. 9.9.1 


Next, we assume that short-range migration takes place. If such a migration 
is restricted to two neighboring colonies, each one step apart, and if m, is 
the rate of such short-range migration per generation so that m,/2 is the 
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proportion of individuals exchanged each generation between a pair of 
adjacent colonies (Figure 9.9.1), then the gene frequency becomes 


P; (Y — m,((1 — m,)p; + ms pj) + = ((1— mj)pi-, + Ps pi) 


m E 2 
+ F {((1 — mM.) Pi+1 + Mo pi). 


or 


= m (1 m Mo) = 
Pj — (1 — m,X1 — Hp, — —— (Dii + Pi+1) + p, 9.9.2 


where P; denotes the frequency of A, in the ith colony after long-range and 
short-range migrations. 

Finally, we assume that random sampling of gametes takes place to form 
the next generation. Thus, if we denote by č; the amount of change in the 
gene frequency resulting from random sampling of gametes in the ith colony, 
the frequency of A, in the next generation is 


Pi =P; + či. 9.9.3 
Since £, represents the change due to random sampling, we have 


Es(6;) =9 9.9.4 
and 


E45) = E 9.9.5 
where N, is the effective population number of each colony and E, designates 
the operator for taking the expectation with respect to random sampling. In 
the following calculation we will denote by E, an operator for taking the 
expectation with respect to the existing gene frequency distribution, so that 
the operator £ for taking the overall expectation is given by E = E, E;. 

Let us first consider the mean p of the gene frequency distribution at 


equilibrium, i.e., E(p;) = p. Taking expectations of both sides of 9.9.3 and 
noting 9.9.2 and 9.9.4, we obtain 


E(pi) = (1 — m4)p + m, pr. 
Since E(p;) = p at equilibrium, we have 


p = Pi. 
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So, we will substitute p for p; in the following treatment. In order to calculate 
the variance and the correlation coefficients of gene frequencies at equilibrium, 
we will let 


Pi—cpi-—-P 
and 

P,—P,;— p. 
Then 9.9.2 becomes 

P; = af, + Pli- + Biss); 9.9.6 
where a = (I — m,Y(1 — m4) and f = m,(1 — m,,)/2. Also, 9.9.3 becomes 

Bi = P;+ &;, 9.9.7 
where p; = p; — P. 

We will denote by V, the variance of the gene frequency distribution 


among colonies. Squaring both sides of 9.9.7 and taking expectations, the 
variance in the next generation is 


Vi, = E(B?) = E,(P? + 2P, EE) + Es(€?)}. 


If we apply 9.9.4 and 9.9.5 to the right-hand side of the above equation and 
note that P; = P; + p, we obtain 


l 52, , P — p) 
Vio = Be) a 9. 
á (1 x) «(Pi) + 2N, 9.9.8 
Now, from 9.9.6 
E(P?) = a? V, + AafV, r, + 2B? V3 + ra), 9.9.9 


where r; is the correlation coefficient of gene frequencies between two colonies 
which are j steps apart, i.e., 
E E 4B; Bi+;) 
j m E a 
V, 


In particular, ry = 1, and also we assume r_, — r,. Substituting 9.9.9 for 
E,(P?) in the right-hand side of 9.9.8, we have 


; 1 B(1 — p) 
v = ( = xx) {a + 4xfr, + 28*(1 + r2)}V, + 2N, ` 9.9.10 
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Thus at equilibrium in which V; = V,, we obtain 
1 p — p) 


PON: 1 
*l1- (: -— Je + 2p? + 4afir, T 2f?r,) : 


9.9.11 


where r, and r, are correlation coefficients of gene frequencies between 
colonies one and two steps apart. 

In order to obtain the correlation coefficients r;(j = 1, 2, ...), let us con- 
sider the covariance (C) in the next generation, 


C; = E(B; bi. ;), 


in which p; = P, + č: and pi, ;— Pii; + 6i. ;. Since for j 21, č; and £i,; 
are mutually independent random variables, each with mean 0, we have 


C; = E((P; + EMP: .; + Ši) 
or 
Ci E ED Pa): 
Then, if we use relation 9.9.6, we get 
C; = E,([ap, + B(Pi-1 + Bis MOB + 5 + Bis ;-i + Bis 54+) 
= a?C; + 2af(C;., + Cj-1) + B (Cj.; + 2C; + Cy-2). 
At equilibrium in which C; = C,, noting r; = C,/V,, we obtain 
(a? + 2B? — Yr; + 2af(rjsi t+ rj-) + B (rjairj-320, (Z1) 
9.9.12 


Equation 9.9.12 holds for j = 1. However, for j = 1, r_, should be replaced 
by r, to give 
(a? + 2]? — Dr, + 2af(r; + 1) + B?(r4 + 71) = 0. 9.9.13 
The essential part of the mathematical treatment of the stepping-stone 
model is to find the solution (r;) for the system of equations 9.9.12 that 
satisfies the boundary conditions, rg = 1 and rẹ =0. This was done by 
Kimura and Weiss (1964) and also by Weiss and Kimura (1965). Here we will 


present an elementary treatment which was given in Kimura and Weiss 
(1964). Let r; = 4? and substitute in 9.9.12; then we have 


(a? + 2]? — 1) + 2«f(A + A7!) + B2(A? +. A172) 200, 
Or 


{a + BA 4-47)? —1=0, 
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from which we obtain the four roots, 


_ (=a) V = 4)? = (287 





Ay 25 
1 "Q7 9- Va = 0)? - Qr 
2^ — Oo 3 
9.9.14 
(1 +a) + /(1 + a)? — (28) 
Ay m E 
2p 
pe Ut) - J/ * ay = (287 
477 2p = 
Then the required solution may be expressed in the form 
4 H 
r,= Y KiAi, 9.9.15 


i=l 


where K;'s are constants. However, since 2, > 1, | >A, > 0, 2, < — 1, and 
—1 <A, < 0, we must have K, = K, — in order that r, vanishes at j = œ. 
Furthermore, in order that ro = 1, we must have K, + K, = 1. Thus, writing 
K for K5, 9.9.15 becomes 


r; 2 KA t (1— K)À. 9.9.16 


In order to determine K, we substitute r; = KA, + (1 — K)44, r = KA} 
+ (1 — KJ, and r, = KA} + (1 — K)A} in 9.9.13. This yields 


2aß + K{(a? + 3p? — 1)2; + 2uBAZ + B?43) 
+ (1 — K){(a? + 38? — 12, + 2242 + 223) — 0. 


Then, if we use the relationships 


9.9.17 











: 1 — 1 — ay, 1 — 
B= z TL pal 2 ii - z 











a B 
and 
14a (1 + «Y l+a 
j= - B À, — 1, a-| p? -i)a + B , 


9.9.17 is reduced to 


2af + KOB?1; — Bla + 1)} + (1 — K){287A, + BCL — 09) — 0. 
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This is further reduced, if we substitute 9.9.14 for A, and A,, to give 


R, — (Ri + Rj)K «0, 


Or 
R, 
K = ———, .9. 
Rik 9.9.18 
where 
R, =./(1 + a)? — (28)? 
and 





R, = /(1 — ay? — (2), 


in which a = (1 — m,)(1 — m,) and f = m,(1 — m,,)/2. 

Therefore, the correlation of gene frequencies between two colonies 
which are j steps apart is given by 9.9.16 with A, and A, given by 9.9.14 and K 
given by 9.9.18. Then, applying this formula to calculate r; and r, in 9.9.11, 
the formula for variance becomes 


"7M PU- p) 
d 1 UT 
2N,(1—1|1- | - ——— 
| ( x) R, +R, | 


In the special case of no migration between adjacent colonies (island 
model), m, = 0 and therefore R, =1+ «, R, = 1 — «a. Thus, 9.9.19 agrees 
with 9.2.13 except that m,, rather than m is used to represent long-range 
migration. 

Weiss and Kimura (1965) developed a more sophisticated method which 
can also treat higher dimensional cases. Using this method, the correlation 
in the one-dimensional model is expressed as 


9.9.19 





1 [- cos j0 d0 
27 Jo 1 — H%(cos 0) 

n=- n T (j20), 9.9.20 
2n J. 1 — HXcos 0) 


where 


H(cos 0) = a + 2 cos 0, 9.9.21 
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in which a2(1—rm,(1—m4)21—m,-—m,(1—-m,) and 2pm, 
x (1 — Ma). Formula 9.9.20 may also be expressed in the form 


= A,(j) + A2()) gas 
^. A,(0) + A;(0)’ H 
where 
1 pr?" cos j0 d0 cos j0 d0 
P A) me Meis REDE ME MM RAM M Md 9. 
iQ) 4n i, 1— H(cos0) 42 J. (1 — a) — 2p cos 8 "ues 
and 
ve cos jð dð . | p” cos j0 d0 NT 
ALU x 1 + H(cos0) áno (1 +a) + 2f cos 0" z 


To evaluate the above integrals the following formula will be found useful: 





9.9.25 


1 
—— fy2 _ mM n 
1 [?"cosn0d0 — D : Inn d 
x andi ^ ]y*! f$ 
Fur x ed x«-1 


Then, 


Ai) 7 = i= Ja? — 1) 9.9.26 





and 


A) = 3I = LG use 1 — ay, 9.9.27 


where 
1—2a ji: m, 
SEPT COR m,(1 — Fi,) 
and 
l+a 2-m, 
a, = — = — 


Thus 9.9.20 agrees with 9.9.16. 
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Though the above solution for r; given in 9.9.16 is perfectly general, a 
simple approximation formula is available for an interesting and important 
special case in which m, < m,. In this case, A,(/) is small in comparison 
to A,(j) so that 


(J) 


approximately. If in addition m, is small so that 1 > m, >m,,, we have, 
from 9.9.19. 


r;=e 9.9.28 


p(l— P) 


= BA 9.9.29 
1 4 AN /2m,fm,, 


P 


approximately. 
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Figure 9.9.2. Decrease of genetic correlation with 
distance when m, = 0.1 and mi = 4 x 10-5. (From 
Kimura and Weiss, 1964.) 
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In Section 9.2 we derived formula 9.2.5, giving the gene frequency 
distribution among subgroups in the “island model." In this model, if the 
entire species is subdivided into colonies, each with effective size N,, and 
each colony exchanges individuals at the rate m with a random sample 
taken from the entire species, the frequency distribution is given by 


$(p) = Cp* "m^ !(1 — pyNema-mzt, 9.9.30 


where C is a constant. 

In the stepping-stone model which we are considering in this section, 
there is a correlation in gene frequency between immigrants and the receiving 
colony so that m,(1 — r,) gives the effective rate of exchange comparable to 
m in the island model. So, the gene frequency distribution among colonies in 
the stepping-stone model may be approximated by 9.9.30 if we substitute 


m =M +m (i-r). 9.9.31 


When m, <m,, we have mx J2mm., since r; z 1 —./2m,,/m, from 
9.9.28. So, from 9.2.7, the variance of gene frequency among colonies is 


, BU-  Pü-p 
P” 4N,m tl AN, /2m,m,, 41 


which is in good agreement with 9.9.29. 

The mathematical treatment of the two- and three-dimensional models is 
more difficult, but it is given in Kimura and Weiss (1964) and in more detail 
in Weiss and Kimura (1965). Figure 9.9.2 illustrates the decrease of genetic 
correlation with distance for one-, two-, and three-dimensional cases, assum- 
ing m, —4 x 1075 and m, = 0.1. The figure shows that it depends very 
much on the number of dimensions. 
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SOME STATISTICAL 

AND MATHEMATICAL 
METHODS 

FREQUENTLY USED 

IN POPULATION GENETICS 


reader may not have encountered or has forgotten. There is no attempt 

at mathematical rigor, although we have tried to supply enough of the 
background to make clear the general nature and limitations of the methods. 
This is written for biologists, not mathematicians, and if one of the latter 
chances to be reading this he is invited to look the other way or at least be 
tolerant of some of the intuitive arguments and cookbook attitudes. 

No knowledge of statistics, matrices, or higher mathematics is assumed, 
but the reader is expected to know the elementary theory of probability and 
to be familiar with the differential and integral calculus. 

This appendix is intended to be sufficient for any procedures used in the 
first six chapters, and most of Chapter 7. However, the last two chapters 
involve subjects of considerably greater mathematical difficulty and it seems 
to be impractical to include all the procedures here. The reader will either 
have to accept some results without derivation or look up the methods else- 
where. We have tried to provide references. 


l f Vhe purpose of this appendix is to supply some of the methods that the 
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Among the various books that might be mentioned as sources of 
additional information we call particular attention to two that are by pioneers 
in the field of population genetics. One is the first volume of Sewall Wright's 
Evolution and the Genetics of Populations, which is devoted to the biometrical 
and statistical foundations of population genetics. The other is R. A. Fisher's 
Statistical Methods for Research Workers. 

There is one method of great importance in population and biometrical 
genetics which we have omitted. This is Sewall Wright's method of path 
coefficients. Although we have used alternative methods in this book, the 
reader should realize that many of the results were first obtained by this 
method. The procedures are derived and explained in Wright's first volume 
(1968). A clear elementary exposition is given by Li (1955a). 


A.1 Various Kinds of Averages 


If there are a large number of observations or measurements it is usually 
difficult, if not impossible, to make much sense of them by examination of 
the individual values. We therefore make use of various derived quantities 
that extract from the data the information in which we are especially inter- 
ested. We usually desire some measure of the central or typical value and 
some measure of the amount of variability. We might wish, in addition, to 
know other things, such as whether the values are symmetrically distributed 
about the central value. Usually, we are especially interested in two quantities, 
the mean and the variance. 

The ordinary average, or arithmetic mean, is represented by My or X 
and is defined by 


l i x 
yt XE Xo + XOT LX, A.1.1 


My = X = 
where X,, X5,..., Xy are the successive measurements, N is the number 
of measurements, and the Greek Y^ means to add all the X's from X, to Xy. 
The convention of indicating the mean by a superior bar is widely used. 

The geometric mean is the Nth root of the product of the N values. It 
is frequently useful for data that are not symmetrically distributed. In 
population genetics the geometric mean is useful in measuring average 
population growth over a period of generations, because of the geometric 
nature of population increase. The geometric mean is defined symbolically by 


1 N OM 
Gx = (XX2 XN = (11 x.) A.1.2 
1 
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For computation, it is more conveniently defined as the antilog of 

za TH 

xc N > Xi, A.1.3 
where x; = log X;. In other words, the geometric mean is the antilog of the 
arithmetic mean of the logarithms of the values. 

Another mean that is used in population genetics, for example in the 


study of effects of chance in small populations, is the harmonic mean. The 
harmonic mean is the reciprocal of the mean of the reciprocals, or 


Lobe (th . 
H, N X, : 1.4 


Table A.1. Some hypothetical data to illustrate computational methods. 








i X; X, logio X, XX (X, — Xy 
i 

l 12 .083 1.079 4 
2 9 .111 .954 =i 1 
3 13 077 1.114 9 
4 7 143 845 23 9 
5 11 .091 1.041 1 1 
6 11 .091 1.041 1 1 
7 9 J 954 zd 1 
8 ll .091 1.041 l ] 
9 7 .143 .845 =3 9 
Sum 90 941 8.914 0 36 

N=9 

. 90 
Mx=X=7=10 
Hx = 9.56 
5 EGAN 7 
9 
Gx —antilog —— = antilog . 990 = 9.77 
Median =11 
36 


Kies d 
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For some purposes, the most appropriate summarizing value is not any 
of the mean values, but the median. The median is the middle value; that is, 
the value chosen so that there are an equal number of observationsabove and 
below it. If the total number of observations is even, the median is defined 
as the mean of the two central values. A familiar example of the use of the 
median is in the halflife of a radioactive element. An exactly analogous prob- 
lem arises in measuring the number of generations through with a harmful 
mutant gene persists in the population; it is sometimes convenient to measure 
its halflife, or median persistence, rather than its mean persistence. 

These various averages are illustrated with a simple numerical example 
in Table A.1. 


A.2 Measures of Variability: 
The Variance and Standard Deviation 


There are also several measures of variability, but we shall consider only two, 
the variance and the standard deviation. The variance, V, usually reflects 


the properties of greatest genetic interest. It is defined as the mean of the 
squares of the deviations of the individual items from their mean. 
Symbolically, 


Vx -iF [(X; - X)?]. A.2.1 


This is illustrated with a numerical example in Table A.1. 
For computation, it is more convenient to write the variance formula as 


Vy = X? — X?; A.2.2 


that is, the mean of the squared value minus the square of the mean value. 
This can be derived as follows. From A.2.1, 


1 x - 
Vy = Ne (X? — 2X X; + X?) 
5 5 [} X? 2X} X;4 NX'] (since E X? = NX?) 
= 5 [Z x} - xj (since Y. X; 2 NX) 
=X- Xx. 


Data are often grouped into classes with the same or similar measure- 
ments. In this case the mean and variance are computed by weighting each 
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measurement by the number of individuals with that measurement. If there 
are n, individuals with measurement X,, n, with measurement X, , and soon, 
then 


n,X, +n Xa+ +X, Yn; X; 


Y= E 2. 
n,tnitcoccAn Yn Pn 
and 
(X; — Xy ;X} — NX? 
y = RMT AE A.2.4 
where N =} n,. 


Notice that, as before, the variance formula can be written 
Vx = xX? = X^. 


where the superior bar now indicates the weighted mean. 

In population genetics the frequencies of individuals with each measure- 
ment or attribute are usually expressed as a proportion of the total. 

If p; =n,/N, then »' p, = 1, 


X -YbiXi, A.2.5 
and 
Vy =) p(X; — XY =} pi Xi — X’. A.2.6 


The variance is always measured in units that are the square of the units 
of the original measurements. This is sometimes inconvenient, as when the 
variance in height of a group of persons is given as square inches, or variance 
in the yield of wheat is in square bushels. In order to return to the original 
dimensions, it is customary to take the square root of the variance, and call 
the resulting quantity the standard deviation. 

Thus, the standard deviation, c, is defined by 


o= JV A.2.7 


The population variance is often denoted by o?. 

Despite these dimensional difficulties, the variance is almost always 
the more useful quantity in population genetics. There are two principal 
reasons. One is that the variance has properties of additivity and sub- 
divisibility whereas the standard deviation does not. This means that if a 
compound measure is the sum of two independent measures, the variance of 
the compound is the sum of the variances of the two parts. For example, if 
we can think of the yield of maize, Y, as being the sum of a genetic compon- 
ent G, and an independent environmental component, E, then Vy = Vg + Vg. 
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The second property of the variance that makes it especially useful in 
population genetics is that the rate of evolutionary change is more closely 
related to the variance than to other measures of population variability. 

The variance of the sum of two measurements may be derived as follows. 
Let X and Y bethe two measurements. 


1 C 
Vy (Kit Yi- X + Y)? 
= E -X+Y,- Y) (since X + Y 2 X +Y) 


= [EG - 3! + X; - Y «2X 05 - X, - Y] 


= Vy + Vy + 2 covyy, A.2.8 


where 
1 z^ Z 
covxr = ọ Y (X; — XY, — Y). A.2.9 


Covyy is called the covariance of X and Y. 


If the measurements are of several quantities, then the variance of the 
sum is 


Vex—), Vx - 2Y covxy, A.2.10 


where the last sum is the sum of all possible pairs of covariances. If there are 
measurements on n objects then there are n(n — 1)/2 covariances and n 
variances. 

For example, if we had measurements on hand length, forearm length, 
and upper arm length for each of N persons, the variance of the total arm 
length would be given by the sum of the three variances and six covariances. 

If the quantities X and Y are independent of each other they will tend 
to deviate from their means in opposite directions just as often as in the same 
direction. In the former case, the value of the product will be negative, in 
the latter, positive; so, on the average, Y (X; — XXY, — Y) will be 0. 

Therefore, if the quantities X and Y are independent 


Vyx.y Vy t Vy, A.2.11 
or, if there are more than two quantities, and they are independent, 


Vyx—. Vy. A.2.12 
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If there are n equally variable and independent quantities, then the 
variance of their sum is 


Vyx =nVy. A.2.13 


If K is a constant 


1 — 
Vex = N > (KX; — Kx)’ 


e x XIKO;— X)P (since KX = KX) A.2.14 


=K E(X- Xy 


= K’ Vy. 


From the foregoing, we derive a very important formula—the variance 
of a mean. If X is the mean of N independent observations, using A.2.12 
we obtain 


Vx 


1? v. o 
`| y. i =— : 
z) Pu (since X N $ x, 
A.2.15 


1 
N Vx (from A.2.13). 


A.3 Population Values and Sample Values 


In the practical use of statistics we are often interested in drawing inferences 
about some population on the basis of a sample of observations from the 
population. In such circumstances we have little interest in the sample values 
themselves except in so far as they provide information about the population. 
If the sample is representative of the population, we can use it as the basis 
for estimates of the unknown population values. 

If we wish to estimate the mean of a population, the mean of the sample 
is on the average a correct estimate. It is unbiased, in the sense that the 
expected value of X is u, the population value. 

On the other hand, the variance of a sample is a biased estimate of the 
variance of the population. However, there is a simple correction that removes 
the bias; simply divide by N — 1 instead of N. 


486 APPENDIX 


We estimate c?, the population variance, by measuring the squared 
deviations from the true mean, u, rather than the sample mean X. Using 
E to stand for the expected or mean value, we have 


V= E EGG - | 


ES LG - 30 € GC- wT) 


el [Spex 42% =p Y Oss T= w]. 


But, Y (X; — X) = 0 and E{(1/N) È (X — uy!) is the variance of the mean 
of X, which is V/N. Hence 


V= Els EO- xy + 5 V = HK - X)"} 


and an unbiased estimate of the population variance is 


EQ- 


V 
N-1 


A.3.1 
Most of the time in this book we are dealing with theoretical populations 
and their variances, so formulae A.2.1 and A.2.2 are appropriate; but when- 
ever one is dealing with actual data and wants to estimate the population 
values from these data, A.3.] should be used. 
The same correction applies to the covariance if the purpose is to 
estimate the population parameter from sample measurements. Thus, 


Y Qi - XXY,- Y) 
N-I f 


COVxy = A.3.2 
The variance of the mean, as estimated from measurements on a sample 
of N observations, is 
y2 
XQGG- X) 


yy -—————. «3. 
y N(N — 1) A.3.3 


A.4 Correlation and Regression 


The covariance is measured in units which are the squares of the original 
measurement units. So is the variance. It is frequently desirable to have a 
measurement of association that is dimensionless. For this purpose the 
conventional measurement is the Coefficient of Correlation. It is the covar- 
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iance divided by the geometric mean of the two variances. Thus the corre- 
lation coefficient, r, is 


COVyy Ia X-XY-Y 














pee = A.4.1 

= J Vx Vy N Ox Oy 

For computation, the formula is conveniently written as 

YQ - XY - Y) 
'xy EE e A.4.2 
J/X.a - xy YY - vy 
Xy «XY 

A 4.3 


V(x? - XXY? — v?) 

The value of the correlation coefficient ranges from —1 to +1. If there 
is no association between the variables (if X and Y are independent) the 
value is 0. With perfect linear association of X and Y, that is to say, if every 
increase in X is associated with a proportional increase in Y, the correlation 
is 1. Perfect negative association gives a correlation of —1. 

The coefficient of correlation is closely related to the regression coefficient. 
The traditional way to express a dependent variable Y in terms of an in- 
dependent variable X is to determine the curve of relationship by the method 
of least squares. By this procedure the curve is chosen so that the sum of the 
squares of the vertical deviations of the actual points from the curve is 
minimized. If the relationship is assumed to be linear, the regression co- 
efficient is the slope of the line. 

Suppose that there are a series of points, (X;, Y;), to which a straight line 
is to be fitted, as shown in Figure A.4.1. We wish to find the line for which the 
sum of the squared deviations, )’ D?, is a minimum. Let the equation of 
the line be 


Y' -f(X) =a + bX, 


where a and b are to be determined. For any value X, the corresponding 
value on the line is Y' = a + bX, so we minimize the quantity 


Q-YyD-Y(Y-YY-zY(Y-a-bxy. A.4.4 


To do this, we follow the usual procedure and differentiate Q with respect 
to a and to 6 and set the resulting quantities equal to 0. This gives 


20 
Ca 
oQ 


g= -2X XQ - a - bX) - 0, 


= -2Y (Y - a - bX) = 0, 
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X X1 X3 Xa Xs 
x 


Figure A.4.1. The procedure for fitting a straight 
line by the least-squares method. X is the 
independent variable and Y the dependent. The 
line is determined so that the sum of the squares 
of the deviations from the line, X( Y, — Y%)?, is 
minimized. 


from which we have the two equations, 
aN*tby X-YY-0, 
ay X+byY X? -F XY =0, 


where N is the number of pairs of measurements and Y' a = Na. The solutions 
are 


a-Y-bX, A 4.5 
where 
N 
and 
ee Y Y 
Y = =, 
N 
and 


= yx? — NX? = V, 2 A.4.6 
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The line of best fit (by the least-squares criterion) is 


Y -a-c-bX 
=Y -b(X — X), A.4.7 


where b (usually written as byy) is given by A.4.6. The slope, byx, is called 
the regression of Y on X. lt gives the amount by which Y changes for a unit 
change in X. Notice that if the two variables are measured as deviations from 
their means, say y — Y' — Y and x — X — X, the equation takes the simple 
form 


y = bx. A.4.7a 


This also shows that the regression line passes through the two means, X 
and Y. 

Another way of writing the regression, one that shows its relation to 
the correlation coefficient, is 

je Cy 

Pac A.4.8 

The regression coefficient is concrete and is expressed in the appropriate 
units of measurement for the dependent and independent variables. For 
example, if X is the pounds of fertilizer applied per acre and Y is the yield 
of corn in bushels per acre, by, is expressed as bushels/pound. On the other 
hand, the correlation coefficient is dimensionless. 

The correlation coefficient has two properties that are especially useful in 
interpreting genetic data. By substituting A.4.5 and A.4.8 into A.4.4, we see 
(after some algebraic rearrangement) that 


yq-vy-ü-nDytq - yvy. A493 


This tells us that r? is the fraction by which the squared deviations from the 
regression line are less than the squared deviations from the general mean, Y. 
In this sense, a fraction r? of the variance of Yis associated with (or explained 
by) a linear change in X ; the remainder, | — r?, represents the fraction of the 
variance due to random deviations from the linear association with X. In 
Chapter 4 we use r?, where r is the correlation between genotypic and pheno- 
typic measurements, as a measurement of heritability, that is to say, the 
extent to which phenotypic variance is accounted for by genotypic differences. 

The other interpretation is this: If the quantitative trait or measurement 
can be regarded as the sum of a large number of components of which a 
fraction, r, are common to the measurements while the remaining fraction, 
| — r, are independent, then the expected correlation between such measure- 
ments is r. For example, a mother and daughter are identical for half their 
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genes, the other half being independent (assuming that the father and mother 
are unrelated). Thus the expected correlation between mother and child for 
a trait determined by a large number of additively acting, independent genes 
would be 1/2. 

The correlation coefficient is useful in another, somewhat related way. 
Suppose that a quantitative trait or measurement, y;;, of the jth member of 
the ith group is made up of three independent and additive components: 


(1) an overall mean, p, 
(2) a component common to the group, g;, and 
(3) a component special to the individual, s;;. 


Then 
Jij^HtditsSij, A.4.10 


and since they are deviations from the mean, 
$ gi=0, *Y5;20. A.4.11 
i rea 


For example, the groups might be families. Then g; is the component, 
genetic or environmental or both, common to all members of the ith family 
and s;; is the additional component special to the jth member of the ith 
family. 

Since the group and special factors are independent, the variance is 


V,—- V(yi = V, V,. A.4.12 


The covariance between the measurements, y;; and yj, of two members of 
the same group is 


cov(yi;, Yu) = E((gi + 51) (8i + Six)} Tm 
= E(gi) + E(gisij) + E(Gi Sx) + ElSij Six). E 


The last three terms are all 0, because s;; and s;, are independent of g; and 
of each other. Since E(g2) — V,» 

cov(yij, Yi) = V, A.4.14 
and the correlation between two members of a group is 


V, V. 


9 pet 
V 


r= 
Va + Vy 





A.4.15 


For example, sincc the correlation between mother and daughter for 
independent and additive genes is 1/2, the variance within mother-daughter 
groups, V,, is 1/2 the population variance, V, . 
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If 5;; and s;, are not independent, then A.4.14 is not correct. An example 
would be competition between litter mates, pre- or postnatally. The death 
of one may enhance the probability of survival of others; or if one is stunted 
and eats less, the others may have more. In extreme cases this might create 
negative values of the last term in A.4.13 large enough to offset the first and 
make the covariance negative. For many purposes, however, the assumption 
that the 5;;'s are independent is a reasonable one. 


A.5 Binomial, Poisson, and Normal Distributions 


If the probability of an event is p, the probability that in N independent 
trials the event will occur exactly n times is given by 


prob(n) = p'(1 — p)"^*. A.5.1 


! 
n!(N — n)! 
Because this is the formula for a term in the expansion of [p + (1 — p)]" 
this is called the binomial disbribution. 
The extension to more than two kinds of events is straightforward and 
is given by the multinomial distribution. If p, is the probability of event 1, 
p; the probability of event 2, p, the probability of event 3, and so on, then 


the probability that in N trials event 1 will occur n, times, event 2 n, times, 
event 3 n, times, and so on, is 


N! 
prob(n;, n2, ...) — pi PYP- A.5.2 


n,!nj!n, !- 


Yn -N, Y p=1. 


The theoretical mean and variance of the binomial distribution are 
easily obtained. If p is the probability of an event, then the expected number 
of occurrences in N trials is N p. 

In a single trial there are two possible numbers of occurrences of the 
event, O or 1, with probability (1 — p) and p. Themeannumberof occurrences 
is, of course, p. Thus, by A.2.6 the variance is 


V = p(1— p)? + (1 — pX0 — p»? 
= p(l — p). 


A.6.3 


Therefore, in N trials the variance of the number of occurrences is(by A.2.13) 
N times as large, or 


Vy = Np(1 — p). A.5.4 
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Likewise, from A.2.15, the variance of the proportion of occurrences in N 
trials of an event with probability p is 


V, = aaa A.5.5 

A case of special interest arises when p is allowed to approach 0 at 
the same time that N becomes indefinitely large in such a way that the product 
N p remains of moderate value. The limiting form approached in this manner 
is the Poisson distribution. The probability of exactly n occurrences of the 
event when the mean number is p(= N p) is given by 


n 


e "u 
n! 





prob (n) = A.5.6 
In particular, the probability of no occurrence is e~”. 

The variance formula is easily obtained from formula A.5.4. As p 
approaches 0 while N p = u remains finite, the variance approaches 


V, =H. A.5.7 


That is to say, the variance is the same as the mean for the Poisson distribution. 

The distribution of measurements of many biological materials, and of 
a great many other things, is often approximated by a symmetrical, bell- 
shaped curve such as is shown in Figure A.5.1. This is the curve of the normal 
distribution and has the equation 





1 _(X-2y 
Y 2———e 2s? , A.5.8 
o / 2n 
-30 -—20  -—o o c 2c 3o 
— 
6896 of Area 
No 
96% of Area 


59 795 of Area 


Figure A.5.1. The normal distribution curve. 
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where p is the mean and o? is the variance. The normal distribution is also 
the limit of the binomial distribution as N gets large while p remains finite. 

As can be seen from the figure, approximately 68 % of the observations 
are within one standard deviation of the mean and about 96% are within 
two standard deviations. The proportion of observations lying between any 
two multiples of the standard deviation may be obtained by numerical 
integration of A.5.8 between the appropriate limits. 

If the observations are not normally distributed, it is often possible to 
transform them to yield a distribution that is approximately normal. For 
example, data that are skewed are often rendered roughly normal by replacing 
the numbers with their logarithms or square roots. 

A convenient property of the normal distributionisthat with large samples 
the mean tends to be normally distributed even when the parent distribution 
is not normal. Therefore, if the standard deviation of the mean (often called 
standard error) is known, it is possible to use the normal distribution to give 
an idea of the precision of the estimate. For example, with an appropriately 
drawn sample, the mean of the sample is expected to lie within two standard 
errors of the population mean about 96 % of the time. 


A.6 Significance Tests and Confidence Limits 


1. Significance Tests for Enumeration Data: The Chi-square Test 
We frequently want to compare observed results with those values that are 
predicted on the basis of some hypothesis. How far from the expectation 
must the observations be before they can be regarded as real differences and 
not simply statistical accidents? One widely used approach is to ask the 
question: What is the probability that, if the expectations are correct, 
results would be obtained that deviate from the expectations by as much as 
or more than those which were observed? An approximate answer to this 
question is given by the Chi-square test. 
The value of y? is given by 


(observed — expected)? 
p = y [Prec oen 


expected 
From the value of y? and one other quantity, the number of degrees of 
freedom, the desired probability can be read from a table or chart. A chart 
is given at the end of this chapter. 

The number of degrees of freedom is the number of classes of observa- 
tions minus one, minus the number of parameters estimated from the data. 
To read the chart, look along the lower horizontal axis until the value of y? 
is reached; then go up from this point to the line corresponding to the 
number of degrees of freedom. The probability is directly to the left. 


A.6.1 
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As an example, suppose a plant breeder who expected a 9:3:3:1 
ratio in an F; cross observed 98, 22, 25, and 15 in the four categories. Since 
the total number is 160, the expected numbers on the 9: 3:3: 1 hypothesis 
are 90, 30, 30, and 10. He would compute y? by 


, (98—90? (22-30)? (25—30)? (15 — 10)? 
5-7—99 ' 39 * 3 ` 10 


— 6.18, prob =.11. 


There are three degrees of freedom, indicated in the subscript to y?. 
There are four categories of plants. In this case no parameters were estimated 
from the data, so the number of degrees of freedom is 4 — 1 = 3. 

The probability, 0.11, is obtained from the chart as the probability 
associated with a x? of 6.17 and 3 degrees of freedom. The interpretation 
is that, if one were to repeat this experiment 100 times with 9:3:3:1 
expectations, 11 times he would expect to get results that deviate from the 
expected 90, 30, 30, and 10 by as much as or more than the observed set of 
results did. 

Stated another way, and somewhat more precisely, the probability of 
obtaining a value of y? equal to or greater than that observed, if the hypoth- 
esis is correct, is .1l. This is useful practically because the probability 
given by the x? procedure is a good approximation to that obtained by an 
exact calculation based on the multinomial distribution. 

As a second example, consider the data in Table A.6.1. These are for 
a double backcross in mink where the theoretical expectation with equal 
viabilities and no linkage is 1 : 1 : 1 : 1. The data are from R. M. Shackleford. 

Again, there are 3 degrees of freedom, but this time the value of y? 
is much larger. From the chart it can be seen that the probability is less than 
the smallest value on the chart, .0001. With probabilities as small as this we 
know that (1) either the hypothesis was wrong, or (2) a very improbable 
event has happened. In this case, we conclude that something is wrong with 
the hypothesis; to a geneticist it is obvious that what is wrong is that the 
genes are linked. 

By convention, if the probability given by the chart is less than .05 we 
say that the difference between the observations and the expectations is 
significant. If the probability is less than .01 we say that the difference is 
highly significant. 

The additivity property of y? is illustrated by the further analysis in 
Table A.6.1. The comparison number 2 tests the segregation ratio for the 
E, e locus. Here, x? = 6.05 for 1 degree of freedom. This corresponds to a 
probability a little larger than 0.01. This is significant, but not highly signifi- 
cant; we conclude that the hypothesis of equality is doubtful, probably 
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attributable to viability differences. The comparison number 3 tests the 
segregation ratio of the B,b locus. Here y^ = 0.80, corresponding to a 
probability of about 0.4; there is no reason to doubt the correctness of this 
expectation. Comparison number 4 tests the hypothesis of nonlinkage. 


Table A.6.1. An example of the calculation of x?. 

















OBSERVED EXPECTED (O — EY 
GENOTYPE PHENOTYPE NUMBER (0) NUMBER (E) WE 
Ee Bb Ebony 22 20 .200 
Ee bb Palomino 7 20 8.450 
1 ee Bb Dark 14 20 1.800 
ee bb Pastel 37 20 14.450 
TOTAL 80 80 24.900 = x3 
Ebony + Palomino 29 40 3.025 
2 Dark + Pastel 51 40 3.025 
TOTAL 80 80 6.050 = xi 
Ebony 4- Dark 36 40 .400 
3 Palomino 4- Pastel 44 40 .400 
TOTAL 80 80 .800 = xi 
Ebony + Pastel 59 40 9.025 
4 Palomino + Dark 21 40 9.025 
TOTAL 80 80 18.050 — x1 
TOTAL for 2, 3, and 4 24.900 = x3 





Here there is a large x°, corresponding to a probability of less than .0001. 
So we conclude that definitely the two loci are not independent. 

By this analysis the total x? for 3 degrees of freedom has been broken up 
in 3 components, each with 1 degree of freedom and each testing a separate 
aspect of the observations. The additivity principle is illustrated by the fact 
that these three values, each of 1 degree of freedom, add up to the original 
x’ with 3 degrees of freedom. 
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As a final example, consider the data in Table 2.22 in Chapter 2. In 
this case we have 


MM MN NN TOTAL 
OBSERVED 363 634 282 1279 
EXPECTED 361.5 636.9 280.5 1278.9 


Xi — 0.027, probability — 0.87 


There is no reason to question the assumptions that led to these expectations 
—namely, correct estimation of the gene frequencies, random mating, equal 
viabilities, no segregation bias, etc. This differs from the earlier examples in 
one regard, however. The number of degrees of freedom is not 2, as might 
have been expected, but 1. This is because the data themselves were used to 
estimate the gene frequency on which the expectations are based. Therefore 
one more degree of freedom has been removed; the number of degrees of 
freedom, then, is 3 — 1 — 1 = 1. The manner in which the allele frequency 
was computed is given in Table 2.1.1. 

There are two cautions that should be mentioned. The first is that one 
must use numbers, not proportions or percentages, in the calculations. The 
second is that the method is only approximate, and the degree of approxima- 
tion gets worse as the numbers become small. A conservative rule is not to 
use the y? method if any expected number is less than 5. 


2. Confidence Limits for Enumeration Data A somewhat similar 
problem arises when statistical procedures are used for estimation of some 
population parameter, rather than to test a hypothesis. This is done by deter- 
mining fiducial or confidence limits. (Actually there are subtle differences in 
these two concepts, but they do not enter into the computations for the 
kinds of examples that we are discussing.) 

If the sample is large enough that the normal distribution can be invoked, 
we can assign confidence limits as follows. If p is the estimate and s is the 
standard deviation of the estimate, then the approximate upper and lower 
confidence limits are 

upper limit = L, = p + ts, 

A.6.2 
lower limit = L, = p — ts, 
where t is chosen according to the probability level desired. The value of t 
is obtained from the t chart at the end of this appendix, using infinite degrees 
of freedom and a probability corresponding to the complement of the 
confidence level desired. For 95% “ confidence," we choose t corresponding 
to a probability of .05. 
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For convenience, here are ¢ values corresponding to several frequently 
used confidence levels. 


CONFIDENCE LEVEL t 
.50 0.67 
.90 1.64 
95 1.96 
.98 2.33 
.99 2.58 


Here is a simple numerical illustration. Suppose in a count of 400 persons, 
160 are found to have blue eyes. These are a random sample of a large 
population (theoretically infinite) within which we should like to estimate 
the true frequency of blue eyes. On the basis of the sample, the estimate 
is p = 160/400 2.40. The variance of this proportion, from A.S.5, is 
(.4)(1 — .4)/400 = .0006. The standard deviation, s, is the square root of this, 
or .0245. If we desire a 98% confidence statement we choose t = 2.33. Thus 
the upper limit is approximately 


L, = .40 + (2.33)(.0245) = .457 
and the lower limit is 
L, = .40 — (2.33)(.0245) = .343. 


We therefore have 98% ** confidence" that the value lies between .343 and 
457. 

We can get an improvement in precision, especially when p is far from 
0.5, by asking the following questions: What value of the true proportion 
would lead to a probability « of getting the observed number or something 
larger and what value would lead to a probability « of getting the observed 
number or something less? These two values would then be the limits for the 
confidence value of 1 — 2a. 

Suppose that p is the observed proportion and z is the true value. The 
probability of equalling or exceeding p in a sample of N is gotten by com- 
puting t — (p — z)/s, where s = ./n(1 — 2)/N and looking up the probability 
in a table or chart of the normal integral. The upper limit is gotten the same 
way. Considering both upper and lower limits, we solve the quadratic, 


2_ N(p — n)? 
n(l—m) ' 
or 
(N + n? — QNp + t?)n + Np? =0 A.6.3 


and the two solutions for x are the upper and lower limits. 
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Thus 
AA (2Np + t°) + Np + PP — 4Np?(N 1?) 
n= 2(N + t?) 
L- (2Np + t?) — J/QNp + Y — 4Np?(N + t?) xcu 


2(N + 1?) 


For example, suppose we observe 8 successes in 40 observations. Then 
p = 8/40 = 0.2 and N = 40. If we desire 99% confidence limits, we choose 
t = 2.58, leading to 0.086 and 0.400 as the lower and upper 99 77 limits. If we 
had used equations A.6.2 we would have gotten the less accurate values 
0.037 and 0.363. 

This procedure is only as accurate as the normal approximation to the 
binomial. The exact answers may be obtained by computing the appropriate 
binomial values. Charts giving binomial limits are available in standard 
handbooks (Beyer, 1966: Pearson and Hartley, 1958). From these charts the 
exact limits to the problem above are 0.07 and 0.41, so A.6.4 gave a very 
good approximation. 

The exact meaning of fiducial and confidence limits has been a matter of 
a great deal of controversy. Some have questioned whether it is proper to 
speak of the probability that the * true" value lies between certain limits. 
For an interesting and characteristically polemic discussion of the deeper 
meanings, see Fisher (1956). 


3. Significance Tests for Measurement Data If the data are from 
measurements rather than counts, the appropriate test of significance is 
based on the : distribution. If we have measurements on two groups and 
would like to know if their means are significantly different we compute t£ 
as follows: 


X, - X, 
N,NKN, + N, = 2) 








where X, and X, are the means of the two samples and V, and V, the two 
variances, computed from A.3.1, 
Y (x; - Xy E dion NX? 

N-I N-1 


V= 


N, and N, are the number of measurements in the two samples. 
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This value of t is introduced into the t chart exactly as y? was in the y? 
chart. The number of degrees of freedom is N, + N, — 2. A probability 
less than .05 is interpreted as a significant difference and less than .01 is 
conventionally regarded as highly significant. 

If the two sets of measurements are paired in some way—for example 
if they are measures of blood pressure of the same person before and after 
administration of a drug—it is usually more appropriate to test the hypothesis 
that the two means do not differ by treating each of the differences as a 
variable. If d, = Xa, — X,,, the difference between the first measurements 
in each group, and so on, 


d 
t = ——, A.6.6 
Va 
where 
(di-a? Yd? Na 
V3 = = 


" NN-1) N(N-1)' 


This value of t is entered into the chart with N — 1 degrees of freedom. 
The probability is that of obtaining an absolute value of d as large as or 
larger than the observed value if the population true difference is 0. 


4. The Significance of a Correlation Coefficient When a correlation 
coefficient has been measured and we desire to know whether it is significantly 
different from 0, the t test is again appropriate. In this case 


= r 
B (1— r A.6.7 


(N — 2) 


t 





where r is the observed correlation coefficient and N is the number of pairs 
of measurements. The appropriate number of degrees of freedom is N — 2. 


5. Confidence Limits for the Mean with Measurement Data The 
confidence limits for the mean of a series of measurements are the same as 
A.6.2. The upper and lower limits are 


L, =X + ty, 


= A.6.8 
Lı = X = Sy, 


where X is the observed mean, Sy is the standard deviation of the mean (the 
square root of the variance, as given below, and t is chosen to correspond 
to the probability desired with degrees of freedom equal to N — 1. 
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For example, if the mean of 11 sample items is 25 and the standard 
deviation of the mean is computed to be 4, we might desire the 95 % confidence 
limits. Looking for the value of t corresponding to P = .05 and 10 degrees 
of freedom, we find from the y? and t chart that the value is between 2.2 and 
2.3 (actually the value is 2.23). Thus the 95 % confidence limits for the popula- 
tion mean are 25 + (2.23)(4), or 16.1 and 33.9. 

The t values in the y? and t chart and in the table in Section A.6.2 are 
for two-tailed tests. That is, they give combined probabilities for deviations 
in both directions. If deviations in only one direction are considered, then 
the probability of a deviation as large or larger than that observed is only 
half as large. If confidence intervals are desired where a deviation in only 
one direction makes sense, then choose a t value corresponding to a prob- 
ability twice as large as that given in the chart, e.g., choose ¢ corresponding 
to 0.02 instead of 0.01. 


A.7 Matrices and Determinants 
A matrix, A, is a set of quantities arranged in rectangular form, such as 


Gi; aiz d,3 i4 
421.322, 23. 24. | 4 
43; 055 G33 G34 
G4, 042 G43 Gag 


where the first subscript indicates the row and the second the column. The 
element in row i and column j is thus designated by a;;. The dimensions of a 
matrix are specified by the numbers of rows and columns; a matrix with m 
rows and n columns is called an m x n matrix. 

If two matrices have the same dimensions, the processes of addition and 
subtraction are direct. The corresponding elements are simply added (or 
subtracted). If matrices A and B are added to produce matrix C, then 
€;; 7 djj + bij. For example, 


3 6 1 x 2 —4 li (5 2 2 

210 i3 5 -7 i5 6 -Ir 

The rule for the multiplication of two matrices is more complicated. 
The element in row i and column j of the product matrix is the sum of the 


products of the elements of rowi of the first matrix and column j of the second. 
That is 


Cj — Y aub. A.7.2 
k 
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This makes it necessary that the first matrix have the same number of columns 
as the second has rows. Here is an example: 


f -2 jt ] _ ( 25) 

l 2 0 14 8 3 

The element in the first row and second column of the product matrix is 
(3 x 1) - (-2 x 1) + (6 x 4) 2 25, for example. 

The product of an m x k and a k x n matrix is of dimension m x n. 
Notice that with matrices AB is not necessarily equal to BA. In fact, unless 
special restrictions are placed on the dimensions of A and B one of the 
products may not even be defined. For example if A is Ó x k and B is k x n, 
A x Bis possible, but B x A is not. 

A matrix with only one row is called a row vector; a matrix with only 
one column iscalled a column vector. In population genetics, the most usual 
multiplication is (m x m)(m x 1); that is, a square matrix by a column vector. 
For example, 


v)-0 


Multiplication of a matrix by a constant yields a matrix in which every 
element is multiplied by that number: e.g. 


ka,, kay. kay; 
kA = kay, kd3; ka, . A.7.3 
kd4, ka}, kay, 


Note that this is equivalent to multiplication by the matrix 


k 0 0 
0 k OF}. 
0 0k 
When k = 1 we have the unit or identity matrix, usually designated by I: 


0 
0 
l 


oon 
om © 


0 
0 
o |= I. A.7.4 
] 


0 0 0 


The unit matrix plays the same role in the algebra of matrices as the number 
1 does in ordinary algebra. Notice that multiplication of a square matrix 
by the unit matrix of the same dimensions leads to the original matrix. 
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A square matrix has a corresponding determinant, written as 


41, 0,5 Ay d44 
|A| = G21 422 G23 24 . 

43; 03; 033 034 

G4, 042 Gay Ag 


A.7.5 


The minor of an element in a determinant is the determinant that remains 
when the row and column of the element are deleted. The cofactor of an 
element is the minor, prefixed by a sign which is determined by the position 
of the element. If the sum of the subscripts is even the sign is positive; if the 
sum is odd the sign is negative. We shall designate the cofactor of the element 
a;; by A;;. The value of a determinant is given by taking any row or column 
and summing the product of each element in the row or column by its 
cofactor. Thus, in the determinant above the cofactor of a3, is 


y Qi; 04,3 04,4 

*2 

A3; —(-1) a21 0534 0245 A.7.6 
G4; A43 daa 


and the value of the determinant is 
|A| = ay, Ayy + 421421 + 43145; + QgyAagy- A.7.7 


In this case the first column was used as the basis for expansion, but any 
other row or column could have been; for example 


|A| = 331431 + 432432 + 433453 + 034434. 


Each cofactor may itself be evaluated as a determinant, and so on until the 
cofactors are single elements. 

The inverse of a square matrix is the analog of the reciprocal of a number. 
The inverse of a square matrix A is denoted by A ^! and satisfies the relation 
AA ^! A^! A = I. The process of obtaining the inverse is rather trouble- 
some. To obtain an element, which is designated a", we consider the cor- 
responding element of the original matrix with rows and columns transposed, 
a;,. We take the cofactor of this element and divide by the determinant of 
the entire original matrix. In symbols, the element a" in the inverse matrix 
A^! is given by Aj,/|A| in the original matrix. If the value of the determinant 
is 0, the matrix has no inverse. 

As an example, consider the square matrix, 


3 1 1 
A-12 2 1]. A.7.8 
2 2 3 
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The determinant of this matrix is 


2 1 lod 11 
ess j|? seel | 


=3x4-2xi+2(-—1) =8. 





The cofactors are 





2 1 
A,, 2 (71) 2 3|=4 
2 1 
A=?) 3]--4 





and so on, so the matrix of cofactors is 


4 —4 0 
-1 7 —4]. 
-1 sel 4 


The inverse matrix is obtained by dividing by |A| and interchanging rows 
and columns, 


œl Aa o|- Col 


CO; & olme cole 


A.7.8a 


o 
| 


It can easily be verified in this example that the product of 4 and its inverse 
is the identity matrix; AA! =I. 

Although the procedure given here always works, it is not the best for 
computational efficiency. Much more rapid procedures for large matrices 
are given in textbooks on numerical methods (e.g., Faddeev and Faddeeva, 
1963). 

As an example of the use of matrices in population genetics, consider 
successive generations of self-fertilization. We let D, H, and R stand for the 
proportions of dominant homozygotes, heterozygotes, and recessive homo- 
zygotes. Using the subscript ¢ to indicate time measured in generations we 
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can write the frequencies in generation ¢ in terms of the corresponding fre- 
quencies in the previous generation. With self-fertilization (see Table 3.1.1), 


D, = D,_; + iH,.;, 
H, 4H,_1, A.7.9 
R, 


4H, -1 + Ry... 


This can be written in matrix form 


1 

D, l 1 0 Dixá 
1 

H, |= 0 2 0 Hesri: A.7.10 
1 

R, 0 4 1 R,-1 


Since there is the same relation between the genotype frequencies at 
times t — ] and t — 2 as between those at times rf and t — 1, we can write 


2 


1 
D, 1 4 0 D,~2 
1 
H, |= 9 o U H,-> 
1 
R, 0 5 1 R,-2 
i t A.7.11 
l z- Do 
1 
= 0 2 0 Ho 
1 
0 " 1 Ro 
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If A is the matrix in A.7.11, then by the rule for matrix multiplication 


3 7 

1 = 0 1 Eu 
8 16 9 

1 1 

A? = 0 = A? = is 
" 0 , 0 : 0 

3 7 

0 = 1 0 Ll 
8 16 ; 


As a numerical example, consider a population in which the initial 
frequencies of the three types are Do = 1/4, Hy = 1/2, and Ro = 1/4. What 
are the frequencies after three generations of self-fertilization ? This is given by 


7 1 
D 1 m 0 = 
> 16 4 
1 1 
H, | = 0 - 0 -l 
3 8 2 A.7.12 
7 1 
aes 1 s 
iL 9 16 4 
from which we can write 
1 15 
4 16 2 32 
1 | 1 
H, = 8 x 2- 16’ A.7.13 
ick 7 ues! 1 ] 15 
3716 2* *473» 


These are the same values obtained by another method in Table 3.1.1. 


A.8 Eigenvalues and Eigenvectors 


We would like to have a way to obtain quantities after several generations 
without the necessity for repeated matrix multiplication. This can often be 
done. Consider the recurrence relations 

X, Ay yXp—y + Ayr Ve-1 + 2320-15 

Ve = d21X,-, + 033 Ja- 1 t 0532Z4- 5, A.8.1 


Z, = d341X,-, + d35 Ya- 1 + 0332i- 17) 
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where, as before, the subscript ¢ measures time in generations. In matrix 
form 


X, Ay, d, Aya) Xii 
Y| S {8z 82 alb ¥r-1 |- A.8.2 
Zi 43, G32 d33/ MZ.-1 


We consider the related quantities /, m, and n, chosen so that 


dj, d dÍ! I 
Ay, d55 à554|| my =Al m]. A.8.3 
G3, G32 d33/ M n 


The column vector is called an eigenvector. Equations A.8.3 may be written 
a, + Ayam + a,4n = 2l, 
ıl + d43 m + d;4n = Àm, A.8.4 
a,l + az M + 0334n = An, 

which is the same as 
ıı! + (a3; —2)m + aa n= 0 A.8.48 
a,l + azam + (a3, = A)n = 0. 


From the rules of algebra, we know that this system of homogeneous 
equations has a nontrivial solution for /, m, and n if and only if 


A — À d,; di3 
a2 d3;—À a3, |=9, A.8.5 
a3, 442 d33 — À 


or, in abbreviated form, 
[A — AI| — 0, A.8.58 


where I is the identity matrix. 

A.8.5a is the characteristic equation of the matrix A and has three roots. 
We assume that the three roots, 4,, 45, and A;, are distinct; that is, that no 
two have the same value. These roots are called eigenvalues (also character- 
istic roots, latent roots, characteristic values, and proper values). When 4, 
is substituted into À.8.4a, the equations can be solved for the ratios /,, m;, 
and n,. Likewise, /; , m}, and n; are obtained by substituting 4, ; and so on 
for /, m3, and n3. 

We note here that /, m, and n are not uniquely determined, since (for 
example) we could divide each equation by n and have three equations in 
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two variables, //n and mjn. However, the ratio /: m: n can be uniquely 
determined if condition A.8.5 is met; so one of the three can be chosen 


arbitrarily. 


The relation between x,_, and x,_, is the same as that between x, and 


x,-,. Therefore, from A.8.2, we can write 


2 
X, a, 01 4,5 X12 
Yı | = [421 422 423 yi-2 
Zi 43, G32 433 21-2 


A.8.6 
Gi, i2 i3 

—|[d;, 422 423] | Yo}- 
a31 432 433 Zo 


The initial values x9, yo, and z are known, and are treated as constants. 


They can be expressed in terms of the /, m, and n's by 


Xo — cl t cjl, +c l, 
Yo = CıM, + CaM, + ems, 


Zo = Cyn, + 4n, + C3 N3. 


A.8.7 


If the three eigenvalues are distinct, these equations can be solved and 
the c's thereby determined. Then a general solution is possible. Substituting 


A.8.7 into A.8.6 gives 


X, dij di y3\'f ly b 0l; tch 
yu|-2|d dn 4234 | Cx t c2» + 63m 
Z, 434. 432 033 Cini + C272 t C373 
A.8.8 
= c4 A'[ m, | c; A'[ m; |  eA'[ ms . 
ni n) ny 
From A.8.3 
l; l; 
A mi| = A; mi |. 
n, n; 


Li i 


Notice that, by the rules of matrix multiplication, if x is a column vector 


A?x = A(Ax) = A(Ax) = A(Ax) = 4?x. 


Continuing this, 
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Substituting in A.8.8, we obtain 


Xi l l l, 

Li t t 
y = eam [c4 M: | + c343| ms |, 
Z, n, n; nj 


which may be written as 
X, = cya, + C5 l A5 + Cy 1, A5, 
y =Cym,A +c mM, 25 + c4 m4 A5, A.8.9 


Zt => c,n441 + C5 n, Àh + C3 n; A5, 


where the c's, /’s, m’s, n’s, and 4's are all constants which have been deter- 
mined or can be determined. 

This gives us an explicit expression for x, y, and z in any generation 
without requiring a series of matrix multiplications. 

One of the eigenvalues, say 4,, will be the largest and, as ¢ gets large, 
Àj is much larger than 44 and 45. Thus after a few generations the structure 
of the population will be determined entirely by the largest or dominant 
eigenvalue. Hence, for many purposes, only the largest root need be found. 

In this book we are mainly concerned with the cases in which the elements 

of a matrix are probabilities, which are necessarily non-negative. Therefore 
the largest eigenvalue, 7,, is positive. Furthermore, the absolute value of any 
eigenvalue is not greater than one. 

As the other roots become small, we see from A.8.9 that 


X: Yiz >l omy i ny A.8.10 


as t — oo. This tells us that the frequencies of the three types represented by 
x, y, and z eventually reach a constant ratio measured by the values of the 
largest eigenvector corresponding to the largest eigenvalue. 

If some of the roots are repeated there may not be a corresponding 
number of independent eigenvectors, but this situation does not arise in the 
examples that we are considering. 

As a numerical example, consider the recurrence relations for repeated 
sib mating as given in equations 3.8.6. 


h, = kı- 1s 
k, = th,-, F tk,_1, 
where the quantities are defined in Section 3.8. The quantity of greatest 


interest is h,, the relative heterozygosity at generation f. 
The characteristic equation is 


E 1 
i$d-A4 


A.8.11 


=) A.8.12 





APPENDIX 509 


On expansion of the determinant, we have 
A —34A4— 3-0, 


giving the roots 


5 
i mee = .809, 





E A.8.13 
4, = : 2 = —.309. 

The general solution for A, is given by 

h, = a4, + bâs, A.8.14 


where a and b can be determined by the value of A for the first two generations. 
Putting t = 0 and t = 1, we obtain respectively 


ho =at b, 
h, = aà, + bà, D 
These can be solved for a and b, giving 
i A, —A, i 
pa taa 
Ài— 4A 


A.8.15 


From the definition, A, is the amount of heterozygosity expressed as 
a fraction of the original amount. If we start with a randomly mating popula- 
tion, ho = ^, = 1. For example, h, = 5/8. 

Notice that when t becomes large 


h, % A, B, , — 0.809h,. ,. A.8.16 


Thus, after a few generations the heterozygosity decreases by a fraction 0.191 
each generation. 


A.9 The Method of Maximum Likelihood 


The principle of this method is to choose as the estimate of the parameter 
the value that maximizes the probability of the observed results. In addition 
to its intuitive appeal, this method has been shown by Fisher to be an opti- 
mum procedure by several criteria (Fisher 1958). 
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As an illustration consider a problem where the answer is already 
known from other considerations, the estimation of gene frequency in a 
2-allele locus with the heterozygote recognizable. We assume random mating. 


GENOTYPE AA Aa aa TOTAL 
EXPECTED PROPORTION p? 2p(1 — p) (1 — p)? 1 
OBSERVED NUMBER A B C N 


The probability of the observed numbers, A, B, and C, as a function of 
the unknown parameter, p, is 


N! 
prob = ATBICI p?4(2p(1 — pA — py?* 
Try A.9.1 
NE N! B 24 * B, Br2C 
simwa? GP 


The same value of p that maximizes the probability will also maximize 
the logarithm of the probability and the algebra is thereby greatly simplified. 


L = log prob = (2A + B) log p + (B + 2C) log (1 — p) + K, A.9.2 


where K is a constant. 
To find the value of p that maximizes L, we equate the derivative of L 
with respect to p to 0. 





curs EN ci ai MN 0, 
dp p i—p 
which has the solution 
B 24 +B 
TUM A.9.3 


as expected. 

The theoretical variance of a maximum-likelihood estimate appropriate 
when the sample is large is given by the estimated value of the negative 
reciprocal of the second derivative 


1 d?L 
V. = -E( 53): A.9.4 


p 


‘ 


The estimated value symbol “E” is taken to mean 
quantities with their maximum-likelihood estimates.” 


‘replace the observed 
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Replacing (24 + B) by 2Np and (B + 2C) by 2N(1 — p), we obtain 


1 — 
V,= ee A.9.5 


which is as expected, for this is equivalent to a binomial sample of 2N genes. 

When more than one parameter is to be estimated, as with multiple 
alleles, the procedure is a straightforward extension, though naturally more 
complicated. Consider the 3-allele model with dominance in Table A.9.1. 


Table A.9.1. A 3-allele model with dominance. A, is dominant to A, and 43, 
while A, is dominant to 43. 


PHENOTYPE A; A2 Az; TOTAL 
GENOTYPES Adı, Aida, A145 A2A2, A2A3 Az A3 
OBSERVED NUMBERS A B C N 
EXPECTED PROPORTIONS Pit 2pipai + 2pips pi + 2pips p3 l 


To simplify the calculations we make use of a very convenient property 
of maximum-likelihood estimates, that of functional invariance. This means 
that any function of a maximum-likelihood estimate is the maximum-likeli- 
hood estimate of that function. Therefore, in this example, we let 


y= P, x = (pa + P3). A.9.6 


Inversely, 


p—- JY  m-Wx- M» pal- yx. A.9.6a 


Then the expected proportions for the phenotypes 4,, A2, and A, are l — x, 
x — y, and y. The probability of the observed numbers in terms of x and y is 


AVBICI (1 = xy*(x = y)*y* A.9.7 


prob — 


and, taking logarithms as before, 


L = Alog(l — x) + Blog(x — y)  Clog y + K, A.9.8 
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where K is a constant. 


OL | —A B 











pisc = 0, 9.9 
Ox DB 5 ü 
oL —B C 

— = +—=0. A.9.9a 
dy x-y y 


Solving for x and y, we obtain 


C _B+C 


y= Wm E s A.9.10 


and, by the principle of functional invariance (see A.9.6a), 


_ [C e "E m (ee fae 
P3 = N’ P2 7 N N° Di = N ‘ 9. 


The same principle applies to more involved cases. The difficulty is that 
the likelihood equations are often impossible to solve except by successive 
approximations. The formulae for the variances are also more involved, but 
straightforward. We shall give the arithmetic procedures later in this section. 

One more very useful property of maximum-likelihood estimates (and 
many other large sample estimates as well) is that if X is a quantity whose 
variance is known and Y is a function of X, the variance of Y is given by 








dY 


2 
Vy = Vy (5) y A.9.12 


For example, if there are 7 people with genotype aa and N — n with 
genotype AA or Aa, then n/N is an estimate of p°, where p is the frequency of 
the a allele. This is the maximum-likelihood estimate. The variance of the 
estimate of p? is p?(1 — p?)/N. Then, by the principle of functional invariance, 


the estimate of the gene frequency p is JnN. The variance of this estimate is 


cL C 








N N |2p 4N iiie 
If three variables are involved, and Z is a function of X and Y, 
V. V (Z) + u(Z) +20 d 
= = — VxY => =: 9. 
dii o. "loy xY 3X OY Am 
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In order to get the variance of the estimates when more than one param- 
eter is being estimated, we make use of matrix methods. We first define the 
information matrix as 


I b 
"mo SI-A, A.9.15 
( xy ly 
where 
8L 
La = —E(— , 
(52) 


ð L 
fay = -2(= z) 


OL 
I, = -2(53) 


In each case the maximum-likelihood estimates are substituted into the 
formulae. From the information matrix the variances can be found. 

The extension to more than two parameters is straightforward, but will 
not be given here. 

Consider again the example of Table A.9.1. Further differentiation of 
A.9.9 and A.9.9a, followed by substitution of the maximum-likelihood 
estimates from A.9.10 (and recalling that A + B + C = N), leads to 


| 


, A. BO (4P) 
Mo qx) Hy AB }? 
i B N? 
7 XL yy? == B A.9.16 
B C B4C 
Eu spem = N?| — —— |. 
I,, TETE (5) 


The inverse of the information matrix is the rariance-covariance or 
covariance matrix. 


fn UE A AS A.9.17 
COV yy Vy l i 


In this case 





A+B 1 
AB B 
LN 
A=N 1 B+C? A.9.18 
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which inverts to 


_, 1 (A(B+C) AC 
lc ] 
eTo N? ( AC C(A + gj 


A.9.19 
Thus 

A(B + C) 

MET 
C(A + B) 
y CN C A.9.20 
AC 
cov "S " 


To obtain the variance of the gene frequency estimates we again use 
formulae A.9.12 and A.9.14. The estimates again are 


C 

py—- / y - JS. 
B+C 

p=l1-,/x ad às ’ 


ns JOE f 


Using these, along with A.9.6 and A.9.10, we obtain 
v. -w(ny AF) A A+B atap 














ay N? 4y 4N? AN C 
Vous v (ny AO VON. belong 
pı ôx) N? 4x NT AN C 
Op, M? Op2 0 
"D (ey +v, (=) + 2 cov,, UP2 0P2 A.9.21 
ôx Üy Ox ey 
| A(B*C)1 | C(A-B)! „AC | 
N dx ^ NS dy NS A xy 





(24+ 5- 24] E | 
~ AN? B+C 


EIS L-(1— py pll- (1 — p] 
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A.10 Lagrange Multipliers 


In maximum or minimum problems it is often required to find a stationary 
value (maximum or minimum) subject to certain side conditions. The 
Lagrange method of undetermined multipliers often effects a great simplifica- 
tion in the algebra. We shall not attempt to explain why the procedure works, 
but will illustrate its use. 

Suppose there is a function f(x, y, z, ...) in which there are k relations 
among the variables, ¢,(x, y, z, ...) 20, $2 2 0,..., $9, — O. To find the 
values of x, y, z, ... which maximize or minimize f, we equate to 0 the partial 
derivatives of the function 


V =f+ Arby 42023 7 + Andy, A.10.1 


where the A,’s are treated as constants. In taking partial derivatives we treat 
X, Y, Z,..., as if they were independent. 

For example, suppose we wish to find the rectangle of maximum area 
inscribed in a circle of radius r. The equation of the circle is x? + y? = r?. 
The area of the rectangle is 4xy. So we write 


W(x, y) = 4xy + A(x? + y? — r°). 


Differentiating, 
oy = 4y + 2Ax = 0, 
Ox 
ôy =4x+2Ay=0. 
dy 


Solving these two equations, A = —2 and x = y; so the figure is a square. 

Of course we could have substituted ./r? — x? for y in 4xy and differen- 
tiated with respect to x only. In this problem the saving of algebra is trivial, 
but in many problems this procedure effects a great simplification. 

To consider a genetic example, what frequency of each of n alleles will 
maximize the proportion of heterozygotes with random mating? It is easier 
to ask what minimizes the proportion of homozygotes. So we ask what 
values of the p;'s minimize )' p? subject to the side condition that Y' p, = 1. 
We write 


y —Y pi * AQ pi — 1), 
ow 
—=2p7,+A=0. 
OP; 
If we add all k equations, 2 È p; + kA =0; but since Y p, = 1, A = —2/k. 
So p; = 1/k for all k alleles, and the heterozygosity is maximized when the 
alleles are equally frequent. 
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his list of articles and books includes many that are not referred to in the 

text; in fact, the majority are in this category. There are also many that we 

have not studied, but we thought it would be more useful to have a list that 
is extensive rather than selective. We hope, by so doing, to call attention to the rich- 
ness and diversity of the literature in this field. 

We have not tried to make the list exhaustive, but we have attempted to include 
most of the papers in the theory of population genetics, particularly those that are 
in English and which have appeared in readily available journals. Mostly these 
papers deal with theoretical and mathematical aspects, but we have included several 
experimental and observational studies, particularly if these include or bear on 
theoretical points. 


Adke, S. R. 1964. A multi-dimensional birth and death process. Biometrics 20: 
212-216. 

Ali, M., and H. H. Hadley. 1955. Theoretical proportion of heterozygosity in 
populations with various proportions of self- and cross-fertilization. Agron. J. 
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variance of the gene frequency charge in 
one generation 

variance in progeny number 

mutation rate, in reverse direction to 
that measured by u 

viability of genotype 4,4; 

reproductive value at age x 

fitness in discrete generation model 

fitness of genotype 4,4; 

fitness of allele 4; 

age 

allele frequency varying stocbastically 

phenotypic value 

phenotypic value at f — 0 

phenotypic value at f— 1 

PıP4/ popa, a measure of linkage disequi- 
librium 


proportion of double reduction 
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